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(57) Abstract 

A eukaryotic cell transformed with a DNA construct comprising a coding sequence encoding a polypeptide having the ac- 
tivity of a mammalian phospholipid kinase or a mammalian protein kinase activated by a phospholipid or its metabolite and re- 
gulatory elements to allow transcription of the coding sequence in the said cell wherein the regulatory elements include a repressi- 
ble or inducible promoter and the expression of the said coding sequence is lethal or growth inhibitory to the cell. In a preferred 
embodiment the cells are Schizosaccharomyce pombe. The cells are used as the basis of an assay for compounds involved in cell 
growth regulation. Such compounds can be used to treat cancers and the formation of blood vessel plaques. 
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EUCARYOTIC CELLS TRANSFOf^^ WITH A IWMALIAN PHOSPHOLIPID KINASE 
OR PROTEIN KINASE AND ASSAYS USING THEM 

The present invention relates to assays for compounds involved in cell growth 
regulation, and more particularly to those involved in transducing signals from 
5 hormones, growth factors and oncogenes. Such compounds represent potential 
drugs or targets for drugs to treat cancers and to prevent the formation of 
plaques which cause heart disease. 

Phosphatidylinositol 3-OH kinase (Ptdlns 3-kinase) catalyses the 
10 phosphorylation of the 3 -hydroxy 1 of inositol in Ptdlns, PtdIns-4-phosphate or 
in PtdIns-4,5-bisphosphate. This activity is involved in transducing signals 
from a number of hormones, growth factors and oncogenes. The standard 
assay for the activity of the Ptdlns 3-kinase involving lipid moieties does not 
readily lend itself to a screen for potential inhibitors (or activators) of catalytic 
15 function. Members of the protein kinase C family of enzymes are involved in 
transducing signals from a number of hormones, growth factors and oncogenes. 
The standard assay for protein kinase C does not lend itself to a screen for 
potential inhibitors (or activators) of catalytic function. 

20 Thus it has been desirable to investigate other means of searching for 
inhibitors. 

A first aspect of the invention provides a eukaryotic cell transformed with a 
DNA construct comprising a coding sequence encoding a polypeptide having 
25 the activity of a mammalian phospholipid kinase or a mammalian protein kinase 
activated by a phospholipid or its metabolite and regulatory elements to allow 
transcription of the coding sequence in the said cell wherein the regulatory 
elements include a repressible or inducible promoter and the expression of the 
said coding sequence is lethal or growth inhibitory to the cell. 

30 
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Polypeptides having the activity of a phospholipid kinase or a protein kinase 
activated by a phospholipid or its metabolite are involved in cell growth 
regulation. 

5 By "growth inhibitory" we mean that the growth rate of cells transformed with 
the said DNA construct is at least two to three fold lower than the same cells 
not transformed with the said DNA construct when grown in the same culture 
conditions. 

10 By "repressible" we mean that in the presence of a repressing agent the 
expression from the promoter is at least two-fold lower than expression from 
the promoter in the absence of the repressing agent. 

It is preferred if expression from the promoter in the presence of a repressing 
15 agent is at least five-fold lower, more preferably ten-fold lower or even more 
preferably 100-fold lower than expression from the promoter in the absence of 
the repressing agent. 

By "inducible" we mean that in the presence of an inducing agent the 
20 expression from the promoter is at least two-fold higher than expression from 
the promoter in the absence of the inducing agent. 

It is preferred if expression from the promoter in the presence of an inducing 
agent is at least five-fold higher, more preferably ten-fold higher or even more 
25 preferably 100-fold higher than expression from the promoter in the absence of 
the inducing agent. 

When an inducible promoter is used there is sufficiently low expression of the 
polypeptide in the uninduced state that the lethal or growth inhibitory phenotype 
30 is not observed whereas when the inducing agent is present the lethal or growth 
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inhibitory phenotype is observed. 

When a repressible promoter is used there is sufficiently low expression of the 
polypeptide in the repressed state that the lethal or growth inhibitory phenotype 
5 is not observed whereas when the repressing agent is absent the lethal or 
growth inhibitory phenotype is observed. 

Suitable eukaryodc cells include mammalian cells, such as COS cells and CHO 
cells, insect cells, slime mould such as Dictyosteliwn, and yeast. 

10 

Suitable regulatable mammalian cell promoters include glucocorticoid-inducible 
promoters and the metallothionein promoter: 

It is preferred if the cell is a yeast cell. 

15 ^ 
Exemplary genera of yeast contemplated to be useful in the practice of the 
present invention are Pichia, Saccharomyces, Kluyveromyces, Candida, 
Torulopsis, Hansenula, Schizosaccharomyces, Citeromyces, Pachysolen, 
Debaromyces, Metschumkowia, Rhodosporidium, Leucosporidiwn, Botryoascus, 

20 Sporidiobolus, Endomycopsis, and the like. 

It is preferred if the yeast is a fission yeast. 

It is further preferred if the yeast is Schizosaccharomyces. 

25 

Preferably, the said polypeptide has the activity of a phospholipid kinase, for 
example a catalytically effective portion of the said kinase. Phospholipid 
kinases include phosphatidyl inositol 3-kinase, phosphatidyl inositol 4-kinase 
and phosphatidyl inositol 5-kinase which phosphorylate the inositol ring on the 
30 3\ 4' or 5' hydroxyl, respectively. 
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Suitably, the said polypeptide is a catalytically effective portion of a 
phosphatidylinositol 3-OH kinase. It is convenient to use the 110 kDa 
mammalian Ptdlns 3-kinase catalytic subunit. 

5 In further preference, the said polypeptide is a catalytically effective portion of 
a protein kinase C (PKC). Suitably, the protein kinase C is PKC-7 or PKC-5 
or PKC-t; or PKC-€. 

A constitutive promoter such as adh may be used (disclosed in ref 1). Also, 
10 the SV40 promoter may be used. 

Thus, a further aspect of the invention provides a Schizosaccharomyces cell 
transformed with a DNA construct comprising a coding sequence encoding a 
polypeptide having the activity of a mammalian phospholipid kinase or a 
15 mammalian protein kinase activated by a phospholipid or its metabolite and 
regulatory elements to allow transcription of the coding sequence in the said 
cell wherein the regulatory elements include a constitutive promoter and the 
expression of the said coding sequence is lethal or growth inhibitory to the cell. 

20 Any gene that arrests growth or is lethal can be expressed only transiently for 
the purposes of subsequent inhibitor screening. In the case of a constitutive 
promoter in a plasmid carrying a marker, freshly transfected cells are diluted 
directly into medium using a combination of growth conditions to select for 
transfectants (for example, medium containing no leucine) and added potential 

25 inhibitors of the constitutively expressed mammalian gene to test for their 
efficacy. 

Mammalian genes whose expression can be controlled by growth conditions can 
be introduced into the yeast under conditions where expression is low (ie 
30 suppressed or not induced) . 
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It is preferred if the mammalian genes so introduced are stably maintained in 
the yeast. 

It is further preferred if the mammalian genes are stably integrated into the 
5 yeast genome. 

Expression is then increased following growth under de-repressing conditions 
(for example removal of thiamine) and potential inhibitors scored on their 
ability to permit growth under these conditions. The use of an integrant and 
a controllable promoter provides the most amenable procedure. The induction 
of cell arrest or cell death provides a powerful screen for a suppressor of such 
events. The present invention provides a screen for suppressors of regulatory 
proteins that control other mammalian functions either directly, for example 
protein kinases, or indirectly through the production of small regulatory 
molecules, for example an inositol lipid kinase. ? 

Thus, in a preferred embodiment, the S. pombe cells contain a coding sequence 
for the 110 kDa mammalian Ptdlns 3-kinase catalytic subunit under the 
regulatory control of the nmt promoter and with other suitable regulatory 
20 elements, such as a transcription terminator, as is known in the art, for 
expression of the said catalytic subunit. In the presence of thiamine the 
promoter is inoperative and the cells carrying the Ptdlns 3-kinase catalytic 
subunit plasmid grow as the parental strain. (It will be appreciated by those 
skilled in the art that the parental strain may not be wild-type. For example 
25 mutant strains containing Ade" or Leu" or Ura' mutations may be used as the 
parental strain to allow selection of plasmid uptake). In the absence of 
thiamine the nmt promoter functions and the Ptdlns 3-kinase catalytic subunit 
is induced. This has been shown by demonstrating a substantial increase in 
Ptdlns 3-kinase activity under these conditions. However, following this 
30 induction the cells cease to divide; cultures plated in the absence of thiamine 



10 



15 
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do not grow but die. 

Derivative of the nmt promoter that retain the thiamine-repressibility 
characteristics of the wild type promoter may also be used. 

5 

As an alternative to the thiamine-repressible nmt promoter, the fbpl gene 
promoter from S. pombe can be used. The fbpl gene promoter is repressed in 
the presence of 8% glucose as disclosed by Hoffman & Winston (1990) 
Genetics 124, 807-816 incorporated herein by reference. Thus, in a further 

10 embodiment, the S. pombe cells contain a coding sequence for the 110 kDa 
mammalian Ptdlns 3-kinase catalytic subunit under the regulatory control of the 
fbpl promoter and with other suitable regulatory elements for expression of the 
said catalytic subunit. In the presence of 8% glucose the function of the 
promoter is repressed and the cells carrying the Ptdlns 3-kinase catalytic 

15 subunit plasmid grow on the parental strains. In the absence of glucose the 
fbpl promoter functions and the Ptdlns 3-kinase catalytic subunit is induced. 

The lethal phenotype of the 5. pombe expressing mammalian Ptdlns 3-kinase 
provides a very powerful tool with which to screen for inhibitors of this 

20 activity. Cells plated in the absence of thiamine will survive and proliferate if 
the activity of the Ptdlns 3-kinase is suppressed, A direct demonstration that 
this is indeed the case, is afforded by the finding that a mammalian Ptdlns 3- 
kinase regulatory subunit (p85a) when coexpressed with the Ptdlns 3-kinase 
catalytic subunit will rescue these cells and allow proliferation. Clearly, 

25 therefore, coexpression of (or generally the presence of) the p85Qr subunit 
should be avoided in the assay of this embodiment, as should, in other 
embodiments, other activity-suppressing compounds. 

In further embodiments the S. pombe cells contain a coding sequence for a 
30 mammalian protein kinase C under the regulatory control of the nmt promoter 
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or the Jbpl promoter. 

As an inhibitor screening process, a further advantage afforded by this 
approach is that general cytostatic and cytotoxic compounds will score negative; 
5 the screen will distinguish the action of the mammalian Ptdlns 3-kinase or 
protein kinase C against the background of a plethora of essential eukaryotic 
gene functions. 

Thus, a further aspect of the invention provides an assay kit comprising a 
10 eukaryotic cell according to the first aspect of the invention and culture medium 
such that the cell will divide and grow and such that the said coding sequence 
is expressed, the expressed polypeptide at least preventing cell division in the 
cell culture, 

15 Conveniently the kit comprises S. pombe as the eukaryotic cell. 

The invention also encompasses compounds identified as being useful in the 
assays of the invention. 

20 These compounds are useful in the treatment of disease and medical conditions 
where there is an undesirable fiinction of a phospholipid kinase or a protein 
kinase activated by a phospholipid or its metabolite. 

Such diseases and conditions include cancer, inflammation, Alzheimer's 
25 disease, restenosis, atherosclerosis and wound healing. 

Suitable promoters and coding sequence can be incorporated into vectors in the 
correct orientation by methods known in the art, some of which are described 
in Sambrook et al (1989) Molecular Cloning, a practical approach (2nd 
30 Edition), Sambrook, J., Fritsch, E. & Maniatis, T., eds. Cold Spring Harbor 
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Laboratory Press, Cold Spring Harbor, New York, incorporated herein by 
reference. 

A variety of methods have been developed to operatively link DNA to vectors 
5 via complementary cohesive termini. For instance, complementary 
homopolymer tracts can be added to the DNA segment to be inserted to the 
vector DNA. The vector and DNA segment are then joined by hydrogen 
bonding between the complementary homopolymeric tails to form recombinant 
DNA molecules. 

10 

Synthetic linkers containing one or more restriction sites provide an alternative 
method of joining the DNA segment to vectors. The DNA segment, generated 
by endonuclease restriction digestion as described earlier, is treated with 
bacteriophage T4 DNA polymerase or E. coli DNA polymerase I, enzymes that 
15 remove protruding, 3 '-single-stranded termini with their 3'-5'-exonucleolytic 
activities, and fill in recessed 3 '-ends with their polymerizing activities. 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large molar 

20 excess of linker molecules in the presence of an enzyme that is able to catalyze 
the ligation of blunt-ended DNA molecules, such as bacteriophage T4 DNA 
ligase. Thus, the products of the reaction are DNA segments carrying 
polymeric linker sequences at their ends. These DNA segments are then 
cleaved with the appropriate restriction enzyme and ligated to an expression 

25 vector that has been cleaved with an enzyme that produces termini compatible 
with those of the DNA segment. 

Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
30 Biotechnologies Inc, New Haven, CN, USA. 
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A desirable way to modify the DNA encoding the polypeptide of the invention 
is to use the polymerase chain reaction as disclosed by Saiki et al (1988) 
Science 239, 487-491. 

5 In this method the DNA to be enzymatically amplified is flanked by two 
specific oligonucleotide primers which themselves become incorporated into the 
amplified DNA. The said specific primers may contain restriction 
endonuclease recognition sites which can be used for cloning into expression 
vectors using methods known in the art, 

10 

Transformation of appropriate cell hosts is accomplished by well known 
methods that typically depend on the type of vector used and host^cell. 
Transformation of Saccharomyces and related cells is described in Sherman et 
al (1986) Methods In Yeast Genetics, A Laboratory Manual, Cold Spring 
15 Harbor, NY. The method of Beggs (1978) Nature 275, 104-109 is also useful. 
With regard to vertebrate cells, reagents useful in transfecting such cells, for 
example calcium phosphate and DEAE-dextran or liposome formulations, are 
available from Stratagene Cloning Systems, or Life Technologies Inc., 
Gaithersburg, MD 20877, USA. 

20 

Schizosaccharomyces pombe may be transformed following LiCl treatment or 
by electroporation. 

Conveniently, a Bio-Rad Pulse Controller may be used for electroporation of 
25 S. pombe cells. 

a) Grow up cells to OD595 less than or equal to 0.5 in minimal medium. 

b) Centrifuge cells at 1500 g for 5 min, remove supernatant and resuspend 
30 in 20 ml ice-cold distilled water, centrifuge again, remove supernatant and 
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resuspend in 20 ml ice-cold 1 M sorbitol, centrifuge again and remove 
supernatant, 

c) Resuspend cells in ice-cold 1 M sorbitol to a density of —5x10^ 
5 cells/ml (concentrated 500 times when compared to original culture). 

d) Use 40-100 /xl of cell suspension per transformation. Add DNA (up to 
100 ng) in 1 ^1 in TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0) to cells and 
incubate on ice 5 min. 

10 

e) Transfer to pre-chilled cuvettes (0.2 cm gap) and apply pulse (1.5 KV, 
25 /xF, 200 Q). 

f) Immediately add 900 p\ of ice-cold 1 M sorbitol and transfer to a 
15 chilled tube on ice. 

g) Promptly spread 100-200 ^1 onto a selective minimal medium plate 
containing 1 M sorbitol and culture at 32°C until grown. 

20 The technique of electroporation of yeast is disclosed in Becker, D.M. and 
Guarente, L. (1990) Meth, EnzymoL 194, 182. 

Machines for electroporation are available from other manufacturers and can 
be used to transform yeast and mammalian cells according to their instructions. 

25 

Successfully transformed cells, ie cells that contain a DNA construct, can be 
identified by well known techniques. For example, cells resulting from the 
introduction of an expression construct of the present invention can be grown 
to produce the polypeptide of the invention. Cells can be harvested and lysed 
30 and their DNA content examined for the presence of the DNA using a method 
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such as that described by Southern (1975) J. MoL BioL 98, 503 or Berent et 
al (1985) Biotech. 3, 208. 

In addition to directly assaying for the presence of recombinant DNA, 
5 successful transformation can be confirmed by well known immunological 
methods when the recombinant DNA is capable of directing the expression of 
the protein. For example, cells successfiiUy transformed with an expression 
vector produce proteins displaying appropriate antigenicity. Samples of cells 
suspected of being transformed are harvested and assayed for the protein using 
10 suitable antibodies, for example by western blotting. 

The invention will now be described in detail with reference to the following 
Examples and Figures wherein: 

15 Figure 1 shows the nucleotide sequence (SEQ ID No 1) and deduced amino 
acid (SEQ ID No 2) of the sequence 110 kDa catalytic subunit of Ptdlns 3- 
kinase (PllO). 

Figure 2 shows the nucleotide sequence of the nmt promoter region (SEQ ID 
20 No 3). 

Figure 3 shows the nucleotide sequence of PKC-e (SEQ ID No 4). 
Figure 4 shows the nucleotide, sequence of PKC-7 (SEQ ID No 5). 

25 

Figure 5 shows the nucleotide sequence of PKC-6 (SEQ ID No 6). 
Figure 6 shows the nucleotide sequence of PKC-t; (SEQ ID No 7). 
30 Figure 7 shows that the lethal effect of pi 10 expression in 5. pombe is 
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suppressed by p85 expression. 

Figure 8 shows the isotype-specific effects of PKC expression in S. pombe. 

5 Figure 9 shows the effect of PKC expression on growth rates in liquid culture. 

Figure 10 shows that PKC-6-induced growth inhibition is the result of kinase 
activity. 

10 Example 1: Assay using catalytic subunit of Ptdlns 3-kinase and nmt 
promoter 

Isolation of Ptdlns 3-kinase catalytic subunit cDNA. The cDNA for the 110 
kDa catalytic subunit can be isolated by a conventional cloning strategy. 

15 Purification of the bovine enzyme from brain tissue (Morgan, Smith et al 1990) 
has demonstrated that sufficient protein can be isolated for protein sequence 
determination. This is unequivocally established for the 85 kDa regulatory 
subunit which has been sequenced from this source and, as a consequence, 
cloned (Otsu, Hiles et al 1991). The Ptdlns 3-kinase from bovine brain (85- 

20 110 dimer) is purified according to Morgan, Smith et al (1990) by sequential 
fractionation with ammonium sulphate and chromatography on DEAE-cellulose, 
phosphocellulose, Sephacryl S-200 and Mono Q. In order to remove 
contaminants and separate subunits, the protein is further purified by sodium 
dodecyl sulphate polyacrylamide gel electrophoresis according to Laemmli 

25 (1970), the 110 kDa protein visualised in ammonium chloride (4N), 
electroeluted and digested with trypsin as described in Katan, Kriz et al (1988). 
Tryptic peptides are then separated by standard procedures and subjected to 
amino acid sequence determination. Sequence established for the 110 kDa 
catalytic subunit is used to predict redundant oligonucleotide probes for 

30 screening a bovine brain cDNA library. Standard cloning procedures are then 
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employed in the isolation of a cDNA encoding the complete open reading frame 
of the 110 kDa subunit (Sambrook et al 1989). The sequence of the cDNA is 
determined by commonly employed dideoxy-sequencing procedures. A specific 
example of using this strategy is described by Hiles et al (1992) Cell 70, 419- 
5 429. 

Materials: Restriction enzymes and DNA modification enzymes were obtained 
from standard commercial sources and used according to the manufacturer's 
recommendations. Oligonucleotides were synthesised on an Applied 
10 Biosystems 380B DNA synthesiser and used directly in subsequent procedures. 

Protein Purification and Amino Acid Sequence Determination: The 

purification of the pSSor and pi 10 proteins by chromatography on a peptide 
affinity column corresponding to amino acids 742-758 of the kinase insert 

15 region of the human PDGF-jS receptor has been described (Otsu et flA:(1991) 
Cell 65, 91-104). Proteins were released from the affinity matrix using SDS- 
containing buffers, separated on a Prosieve agarose gel, and visualised by 
staining with Coomassie blue G250. The band corresponding to pi 10 was 
excised and protein was eluted by tube gel HPEC. Protein was precipitated 

20 from pi 10-containing fractions by treatment with trichloroacetic acid and then 
washed with acetone. The pi 10-containing pellet was resuspended and digested 
with lysylendopeptidase in the presence of SDS, and peptides were separated 
by tandem ion-exchange chromatography and reverse-phase HPLC. This 
procedure was carried out on three separate PI3-kinase preparations. A fourth 

25 preparation was eluted from the matrix as before and boiled for 5 min. After 
cooling, the sample was diluted with 25 mM Tris-HCl (pH 8.8) and digested 
directly with lysylendopeptidase for 72 hr at SO^^C. Peptides were separated 
as above. Peptide sequences were determined using a modified Applied 
Biosystems 477A automated pulse-liquid sequencer. 



30 
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mRNA Isolation and cDNA Cloning: Total RNA was isolated from SGBAF-1 
cells by the method of Chirgwin et al (1979) Biochemistry 18, 294-299 and 
poly(A) mRNA was selected by chromatography on oligo-(dT)-cellulose 
(Maniatis et al (1982) Molecular Cloning: A laboratory manual. Cold Spring 
5 Harbor Press, Cold Spring Harbor, New York). An oligo-dT primed cDNA 
library of 5 x 10^ primary recombinants was constructed in lambda Uni-Zap 
(Stratagene) from 5 /ig of this mRNA using the Stratagene Uni-Zap cDNA 
cloning system. The construction of the total bovine brain cDN A library in 
lambda Uni-Zap has been described previously (Otsu a/ (1991) Cell 65, 91- 
10 104). 

Library Screening and Hybridisations: The unamplifred SGBAF-1 cDNA 
library (10* recombinants) was plated on E. coli K12 PLK-F (Stratagene) at a 
density of 10^ plaques per 15 cm dish, and lifts were taken in duplicate onto 

15 nitrocellulose membranes (Millipore) . For screening, filters were prehybridised 
for at least 1 hr at 42°C in 6 x SSPE, 0.5% SDS, 10 x Denhardt's solution, 
and 100 /xg/ml denatured sonicated herring sperm DNA (Sigma). Hybridisation 
was carried out in the same solution containing 10 ng/ml radiolabeled 
oligonucleotide. Oligonucleotides used were: peptide N, (MDWIFHT; SEQ 

20 ID No 8) 5'-AA(G/A)ATGGA(T/C)TGGAT(C/T/A)TT(T/C)CA(T/C)AC-3' 
(SEQ ID No 9); peptide J (DDGQLFHIDFGHF; SEQ ID No 10) 5'- 
GATGATGGCC-A(G/A)CTGTT(T/C)CA(T/C)AT(T/A)GA(T/C)- 
TTTGGCCA(T/C)TT (SEQ ID No 1 1). Oligonucleotides were labeled with ^2? 
at the 5' end in a 20 n\ reaction containing 100 ng of oligonucleotide, 1 x 

25 kinase buffer (Promega), 0. 1 mM spermidine, 5 mM dithiothreitol, 100 /xCi of 
[7-^-P]ATP (5000 Ci/mmol, Amersham), and 2 ^1 (20 U) of T4 polynucleotide 
kinase (Amersham). Filters were washed in 6 x SSC, 0.1% SDS at room 
temperature and then subjected to autoradiography using Kodak XAR film. 
Hybridising clones were plaque purified and rescued as plasmids according to 

30 the manufacturer's instructions. 
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Characterisation of cDNA Clones: Sequencing was carried out by the chain 
termination method using the Sequenase system (US Biochemicals). Clones for 
sequencing were obtained by directed cloning of restriction fragments into 
M13mpl8 and mpl9 vectors (Yanisch-Perron et al (1985) Gene 33, 103-119) 
5 and by making a series of exonuclease Ill-mediated deletions (Henikoff (1984) 
Gene 28, 351-359; Pharmacia Exonuclease HI deletion kit). DNA sequences 
were analysed on a Micro- VAX computer using the Wisconsin sequence 
analysis package (UWGCG; Devereux et al (1984) Nucl Acids Res. 12, 387- 
395). 

10 

RACE PCR: RACE PCR was carried out essentially as described previously 
(Frohman et al (1988) Proc. Natl Acad. Sci. USA 85, 8998-9002; Harvey and 
Garlison (1991) Nucl. Acids Res. 19, 4002). In brief, first-strand cDNA 
primed with random hexamers (Amersham) was synthesised from 1 ^g> of 

15 SGBAF-1 cell mRNA using the Stratagene first-strand cDNA synthesis kit. 
First-strand cDNA was isolated by isopropanol precipitation and tailed with 
oligo-(dA) using terminal deoxynucleotidyl transferase (Bethesda Research 
Laboratories). PCR was performed using oligo 2224 (5'- 
AATTCACACACTGGCATGCCGAT; SEQ ID No 12) and adaptor dT (5'- 

20 GACTCGAGTCGACATCGA l i 1 1 l lTl 1 1 11111 IT ; SEQ ID No 13) as 
primers, using a Perkin-EImer Cetus Taq polymerase PCR kit (conditions: 30 
cycles of 94'C for 1 min, 35 °C for 1 min, 72°C for 2 min). Products were 
fractionated on a 1 .5 % low melting point agarose gel and visualised by staining 
with ethidium bromide. The gel was sliced into six bands (ranging from 150 

25 bp to 2000 bp), and DNA was isolated from each gel slice. A further round 
of PCR was performed on this DNA using oligonucleotide 2280 (5'- 
TTTAAGCTTAGGCATTCTAAAGTCACTATCATCCC; SEQ ID No 14) and 
adaptor (5'-GACTCGAGTCGACATCGA; SEQ ID No 15) as primers 
(conditions: 35 cycles of 94''C for 1 min, 56*C for 1 min, 72*C for 2 min). 

30 Products were fractionated on an agarose gel and visualised by staining with 
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ethidium bromide. A band 250 bp shorter than the size of the DNA in the gel 
slice used for the PGR was expected. An intensely staining band of 350 bp 
obtained from the —600 bp gel slice was excised, digested with HindUl and 
Sail, and ligated into Bluescript KS- digested with HindUl and Xhol to give 
5 plasmid pBS/race. Two independent inserts were completely sequenced. The 
sequence of pi 10, the 110-kD catalytic subunit of PI3-kinase is shown in 
Figure 1 and has the GenEMBL Accession No M93259 (SEQ ID No 1), 

Isolation of nmt promoter. The promoter has been isolated by Maundrell (2) 
10 and may be isolated by repeating the procedures reported in that reference. 
Moreover, the sequence of the gene, including the promoter, has been 
submitted to the GenBank'^'^/EMBL database as Accession No J05493 and is 
shown in Figure 2 (SEQ ID No 3). 

15 Vectors containing the nmt promoter and derivatives of the nmt promoter 
suitable for use in the present invention are described by Basi et al (1992) Yeast 
8, S597 (special issue) and Maundrell (1990). 

The upstream regulatory region and downstream polyadenylation site of nmtl 
20 have been incorporated into two types of 5. pombe/E. coli shuttle vector: 
pREP extrachromosomally replicating plasmids and pRIP integrating plasmids. 
Using either of these constructs thiamine mediated transcriptional regulation can 
be transferred to heterologous coding sequences. 

25 The time course of induction and repression have been studied as a function of 
changes in the intracellular thiamine concentration. Addition of thiamine to 
cells growing in minimal medium results in a rapid rise in the internal thiamine 
from a basal level of around 10 pmoles/lO"' cells to up to 1000 fold this level 
and this is accompanied by repression of nmtl promoter activity. If cells are 

30 then washed and allowed to continue growth in minimal medium, the 
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intracellular thiamine is progressively diluted as the cell mass doubles and 
transcription is reinitiated as the internal thiamine concentration falls below 50 
pmoles/10^ cells. The time taken to re-activate the nmtl promoter therefore 
depends on the internal thiamine concentration at the time when the cells are 
5 transferred to thiamine free medium. 

Quantitation of promoter strength was assessed using chloramphenicol acetyl 
transferase as a reporter gene. The fiiUy induced nmtl promoter is about 6 fold 
more active than the S. pombe adh promoter and its activity is reduced about 

10 80 fold when cells are grown in repressing conditions. These vectors are 
ideally suited to applications requiring maximal expression of a gene of interest. 
In addition, two modified versions of the promoter with reduced activity have 
been created following an analysis of the effects of TATA box mutations. 
Truncating the wild type TATA box, TATATAAA to ATAAA (the '4' series) 

15 or AT (the *8' series) down-regulates transcriptional activity of the nmtl 
promoter by approximately 1 and 2 orders of magnitude respectively (see 
Table). These mutations in the TATA box do not affect thiamine repressibility 
or the site of transcription initiation. 

20 The table below summarises the salient features of some of the vectors which 
have been constructed thus far: 
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^ the Ball site is replaced with an Xhol site allowing expression from the 
ATG. 

^ in some of the vectors the complementation gene used for selection of 
plasmid uptake has been changed from the LEU2 gene to the sup 3.5 
5 gene which complements the Ade 6.704 mutation or to the URA4 gene. 

The backbone of the plasmid is not altered (ie promoter and stop 
sequence from the nmtl gene, ARSl and pUC119 backbone). 



Construction of an 5, pomhe pllO expression system. A suitable restriction 
10 fragment containing the complete 110 kDa subunit open reading frame and 
flanking sequences is subcloned into the nmt promoter plasmid containing a 
suitable marker gene for selection creating an nmr- 100 plasmid in order to 
allow expression of the 110 kDa protein under the control of the thiamine 
repressible nmt promoter. The nmt A 10 plasmid is grown in a suitable bacterial 
15 host and the plasmid purified by conventional techniques (Sambrook et al 
1989). A 3.4 kb BamHl/Fspl fragment containing the cDNA of pi 10 was 
isolated and subcloned into the BamHl/Smal sites of pREP3X-pl 10 (nrnt-l 10). 

The nmt-l 10 plasmid is then transfected by standard procedures (Moreno, Klar 
20 et al 1991) into a Schizosaccharomyces pombe strain that is auxotrophic for 
leucine cells are transformed using electroporation, Transfected cells are then 
plated in the presence of thiamine and in the absence of leucine. As an 
alternative Schizosaccharomyces pombe strains which are auxotrophic for 
adenine or uracil (that is Ade" or Ura ) may be used; in this case the cells are 
25 plated in the presence of thiamine and absence of adenine, or the presence of 
thiamine and absence of uracil, respectively. Colonies growing up under these 
conditions are then analysed for the presence of the nmt-WQ plasmid. The 
lethal phenotype caused by the expression of 1 1 0 kDa protein is checked by 
replating colonies in the presence or absence of thiamine; under the latter 
30 conditions colonies will arrest and/or die. 
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For the purposes of setting up a screen for inhibitors, a stable transformant is 
isolated. This is carried out by standard procedures involving growth in the 
presence and absence of the selectable marker leucine (or adenine or uracil). 
Isolates obtained in this manner are checked for the stable insertion of the 110 
5 kDa sequence into genomic DNA by Southern analysis or stable replication of 
a non-integrated plasmid. Expression of the pi 10 protein is also confirmed by 
western blot analysis of the transformants using antibodies reactive against 
pi 10, or by measuring the activity of the pi 10 subunit in the transformed cells. 
The inducible lethal phenotype is rechecked by growth of these isolates in the 
10 presence and absence of thiamine ( > 10 nM). 

It is preferred if 100 nM, or > 1 pM or > 1 /xM is used. 

It is most preferred if 15 /xM thiamine is used. 

15 

Operating the screen* The screen for inhibitor activity is carried out on a 96- 
well microtitre plate format. An integrant colony is picked and put into liquid 
culture in minimal medium, 2% glucose, 15 fiM thiamine and supplements 
appropriate for the strain (eg uracil 50 jug/ml would be included for a ura' 

20 strain if the integrated plasmid did not harbour a URA4-based selection 
marker). This culture is grown up and, after extensive washing, used to seed 
two 10 ml cultures, one containing thiamine as above, and one without. The 
cultures are expanded overnight and then diluted to an optical density (OD) at 
595 nm of 0.01-0.10. For those cells requiring treatment for arrest of growth 

25 additions are made at this stage prior to plating. The diluted cultures are then 
aliquoted into wells of a sterile 96 well microtitre plate containing individual 
test compounds in the presence or absence of thiamine. The growth of the cells 
is monitored over time until the OD595 reached is —0.8 for control cultures. 
Control cultures are those cultured with thiamine. The OD595 is assessed using 

30 a microtitre plate reader. 
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The cells precultured in thiamine and retained in thiamine serve to indicate 
optimum growth rate. Cells precultured in the absence of thiamine and then 
put into wells containing thiamine provide a control for the rescue of growth. 
Cells precultured in the absence of thiamine and put into wells in the absence 
5 of thiamine or test compound provide a baseline for non-growth. Individual 
test compounds are assessed for their potency in permitting growth in the 
absence of thiamine in cells plated in the absence of thiamine. 

Accumulated experience in the operation of this screen for a particular gene 
10 product permits a less frequent monitoring of the growth curves and a single 
time point may be found to be sufficient. Similarly, cultures propagated 
throughout in the presence of thiamine may be found to be a non-essential 
control. These alterations to the procedure may provide some practical 
advantages in increasing the number of test compounds per 96 well plate and 
15 in reducing the time required for assessment of growth. 

The above procedures have been employed in creating an S. pombe strain 
harbouring a pi 10 cDNA under the control of the nmr promoter. Switching 
these cells from a medium containing thiamine (15 ptM) to one in the absence 
20 of thiamine causes growth arrest. Evidence that the arrest is a consequence of 
the expression of the mammalian protein has come from a number of 
observations: 

1 . Transient transfection and subsequent expression has been observed on 
25 multiple occasions with the pi 10 cDNA and not with the vector alone. 

2. On expression of the pi 10 protein, it is possible to detect the activity 
of the expressed mammalian protein in cell extracts, ie the catalytic 
activity is retained on expression in 5. pombe. 

30 



wo 94/03609 



PCr/GB93/01651 



22 

3 . On expression of the mammalian regulatory subunit of the kinase, p85a 
[4], increased expression of pllO no longer induces growth arrest. 

The use of this system as a viable tool for screening pllO inhibitors is 
5 evidenced by the ability of pSSa, the regulatory subunit, to suppress the growth 
arrest phenotype. Biochemical evidence has already established that the p85a- 
pl 10 complex is less active than the free pi 10 protein [9]. 

The lethal effect of pi 10 expression in S. pombe is suppressed by p85 
10 expression as shown in Figure 7. Stable pi 10-expressing S. pombe cells were 
transformed with the pREP4 vector, or the pREP4-p85a or pREP4-p85^ 
constructs and, after selection for plasmid uptake, were streaked onto selective 
minimal medium plates in the presence or absence of thiamine. Expression of 
pUO alone is lethal but this effect is rescued by co-expression of either p85a 
15 orp85/S. 

The pSSor and p85/3 cDNAs can be obtained using the methods described by 
Otsu et al (1991) Cell 65, 91-104 incorporated herein by reference. 

20 Example 2: Isotvpe-specific effects of PKC expression in S. pomb e and the 
effect of PKC expression on growth rates in liquid culture 

S. pombe strains containing integrated plasmids for expression of mammalian 
PKC-7, -5, -e, -f or -t} were streaked onto selective minimal medium plates in 

25 the absence of thiamine or the presence of thiamine or TPA as shown in Figure 
8. Growth of control (vector) or PKC-f cells was similar under all three 
conditions. PKC-7 expression (Figure 8, plate B) marginally decreased growth 
and TPA addition to these cells totally supressed growth (Figure 8, plate C). 
PKC-6, -€ and -17 expression alone was markedly growth inhibitory (Figure 8, 

30 plate B). 
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Stable PKC-iS. pombe strains were cultured in minimal medium in the absence 
of thiamine for 18 hours until an OD^^^ of 0.2-0.5 was attained (see Figure 9). 
Strains were then (at time zero) diluted to an OD^^^ of 0.02 in minimal medium 
and cultured in the presence of 1 pM thiamine (controls) (a), in the absence 
5 of thiamine (■) or in the absence of thiamine with 100 ng/ml TPA (O). At 
the indicated times, the cell density was calculated by measuring the OD^'^. 
PKC-f cells grew at a rate essentially indistinguishable from vector controls. 
PKC-5, -6 and -17 expression markedly delayed growth when compared with 
vector controls (-thiamine). Growth of PKC-y, -5 and -7; expressing cells was 
10 essentially nil when cultured in the presence of TPA. 

Example 3: An inhibitor screen for protein kinase C-g 

Protein kinase C-e [10] cDNA (Figure 3; SEQ ID No 4) has been introduced ^ 
15 into a plasmid under the control of the runt promoter yielding n/nr-PKC-e, A 
2.7 kb Xhol fragment with the fiill coding sequence for PKC-e was isolated 
from pMT2-PKC-€ and subcloned into 5a/I-digested pREP3X. Then 300 bp of 
5' non-coding sequence was removed by digesting with Xhol and Ncol, blunting 
the ends and religating to give pREP3X-PKC-€. The plasmid pMT2-PKC-e can 
20 be prepared by the methods described by Schaap et al (1989) FEES Lett. 243, 
351-357. Transfection of this construct into 5. pombe employing selection for 
uptake of the LEU2 gene in the presence of thiamine, yields populations of 
cells that on switching to "no thiamine'* conditions while retaining selection for 
LEU2, reduce growth rate. 

25 

Growth inhibition is consistent with the expression of the mammalian PKC-e 
gene product since: 

1 . Growth inhibition correlates with an induction of the PKC-6 protein as 
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judged by Western analysis. 

2, The induced phenotype also correlates with expression of PKC-e 
activity as determined in cell extracts. 

5 

3 . Suppression of PKC-e expression by exposure to the phorbol ester TPA 
can rescue cells that are expressing low levels of PKC-€ (cells 
expressing high levels of PKC-e are not rescued and the steady state 
level of PKC-e is not significantly depressed by TPA treatment). 

10 

The expression of a functional PKC-e activity in S. pombe and its correlation 
with growth arrest under various growth conditions provides the basis for an 
inhibitor screen. 

15 The transformed cells are plated in the presence of thiamine (control) and the 
absence of thiamine (test) and the compound to be assayed is added to the 
"test" plates. 

Example 4: An inhibitor screen for protein kinase €-7 . 

20 

A cDNA for PKC-7 (Figure 4; SEQ ID No 5) has been introduced into a 
plasmid under the control of the nmt promoter, producing n/nr-PKC-7. A 2.4 
kb fiamHI/blunt ///ndlll fragment with the full coding sequence of PKC-7 was 
isolated from pSP64-PKC-7 and subcloned into the BantHl/Smal sites of 

25 pREP3X to give pREP3X-PKC-7. The plasmid pSP64-PKC-7 can be prepared 
as described by Patel & Stabel (1989) Cell SignalL 1, 227-240.Transfection 
of S. pombe with nmr-PKC-7 yields populations of cells that on switching to 
medium without thiamine induce PKC-7 protein as determined by Western 
blotting and the detection of PKC activity in cell extracts. These cells continue 

30 to grow on induction but if the PKC-7 is selectively activated by inclusion of 
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the phorbol ester TPA in the growth medium, the cells will arrest. The 
dependence of growth arrest upon the inclusion of TPA provides direct 
evidence that the catalytic function of PKC-7 is responsible for the phenotype. 
No such arrest is observed on treatment of the original S. pombe strain. Other 
5 PKC activators, such as Mezerein, or other phorbol esters or diacylglycerols 
may be used in place of TPA. 

That activation of PKC-7 induces growth arrest provides a screen for inhibition 
of function of this mammalian gene product. 

10 

Operating the screen. The screen for inhibitor activities is carried out on a 
96-well microtitre plate format. For thiamine repressible genes, stable 
integrants are grown up overnight (12 h) in the absence of thiamine. The 
culture is then diluted in the absence of thiamine to an OD595 = 0.01 to 0.10. 

15 The culture is then aliquoted into microtitre wells containing the potential 
inhibitors and, in the case of PKC-7, also phorbol ester. The growth of cells 
monitored at 595 nm using a microtitre plate reader. Cells are allowed to grow 
until parallel wells/plates containing cells growing in the presence of thiamine 
(15 /iM) have increased their OD595 to 1 ,0 units. Cells from the test wells that 

20 have proliferated can be scored relative to both control wells (eg + thiamine) 
and no addition wells (-inhibitor, -thiamine). 

Thus, for PKC-7 there are the following possibilities: (i) control plates which 
are + thiamine or -thiamine or -thiamine + TPA and (ii) test plates which are 
25 + thiamine + compound or -thiamine + compound or -thiamine + TPA + 
compound. 
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Vvatn plP 5: An inhibitor ser pen for protein kinase C-5. 

A cDNA for PKC-5 (Figure 5; SEQ ID No 6) has been introduced into a 
plasmid under the control of the nmt promoter, producing nmr-PKC-5. A 2.4 
5 kb blunt PfMllNdel fragment containing the full coding sequence of PKC-6 
was isolated from pBluescript-PKC-6 and subcloned into blunt Safl-digested 
pREP3X to give pREP3X-PKC-6. The plasmid pBluescript-PKC-S can be 
obtained using the methods described in Olivier & Parker (1991) Eur. J. 
Biochem. 200, 805-810 incorporated herein by reference. Transfection of 5. 
10 pombe with nmf-PKC-6 yields populations of cells that on switching to medium 
without thiamine induce PKC-6 protein as determined by Western blotting and 
by activity measurements. There is marked growth inhibition by expression 
alone and if the PKC-5 is activated by inclusion of the phorbol ester TPA in the 
growth medium, the phenotype is strengthened. Experiments with PKC-5 also 
15 provide firm evidence that the phenotype is a result of the ftmction of the 
kinase. Part of the kinase domain of PKC-5 was deleted thus rendering it 
enzymatically inactive. The product was expressed to a high level in 5. pombe 
but there was no growth inhibition thus indicating that the phenotype is due to 
the functional kinase. 



20 



That activation of PKC-5 induces growth inhibition provides a screen for 
inhibition of function of this mammalian gene product. 

Operating the screen. The screen for inhibitor activities is carried out on a 
25 96-weU microtitre plate format. For thiamine repressible genes, stable 
integrants are grown up overnight (12 h) in the absence of thiamine. The 
culture is then diluted in the absence of thiamine to an OD595 = 0.0 1 to 0.10. 
The culture is then aliquoted into microtitre wells containing the potential 
inhibitors and, in the case of PKC-7, also phorbol ester. The growth of cells 
30 monitored at 595 nm using a microtitre plate reader. Cells are allowed to grow 
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until parallel wells/plates containing cells growing in the presence of thiamine 
(15 ^tM) have increased their OD595 to 1.0 units. Cells from the test wells that 
have proliferated can be scored relative to both control wells (+ thiamine) and 
no addition wells (-inhibitor, -thiamine). Additionally, the test wells may 
5 contain or lack TPA. 

Figure 10 shows that the PKC-5-induced growth inhibition is the result of 
kinase activity. S. pombe cells were transformed with a control vector or 
vectors to express the fiiU length PKC-5 protein or a PKC-5 protein in which 

10 part of the catalytic domain has been deleted to render it functionally inactive 
as a protein kinase (PKC-5A). After selection for uptake of plasmid, a number 
of colonies were plated onto selective medium plates in the presence of 
thiamine, the absence of thiamine or the presence of TPA. PKC-5 expression 
markedly inhibits growth (-thiamine plate) and addition of TPA increases the 

15 effect. In contrast, expression of PKC-5A has no effect on growth under any 
condition. 

Example 6: An inhibitor screen for protein kinase C-iy . 

20 A cDNA for PKC-t; (Figure 6; SEQ ID No 7) has been introduced into a 
plasmid under the control of the nmt promoter, producing /imr-PKC-ij. A 3,3 
kb Xhol fragment containing the coding sequence for PKC-t/ was isolated from 
pBluescript-PKC-77 and subcloned into 5a/I-digested pREP3X to give pREP3X- 
PKC-t;. The plasmid pBluescript-PKC-?; can be obtained using the methods 

25 described by Dekker et al (1992) FEES Lett, 312, 195-199. Transfection of 5, 
pombe with nmt-PYJZ-r} yields populations of cells that on switching to medium 
without thiamine induce PKC-t; protein as determined by Western blotting and 
the detection of PKC activity in cell extracts. However, there is some 
expression even in the presence of thiamine which produces —50% growth 

30 inhibition. There is an even more marked growth inhibition by derepressed 
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expression alone and if the PKC-77 is selectively activated by inclusion of the 
phorbol ester TPA in the growth medium, there is no growth. 

That activation of PKC-i; induces growth inhibition provides a screen for 
5 inhibition of function of this mammalian gene product. 

Operating the screen. The screen for inhibitor activities is carried out on a 
96-well microtitre plate format. For thiamine repressible genes, stable 
integrants are grown up overnight (12 h) in the absence of thiamine. The 
10 culture is then diluted in the absence of thiamine to an OD595 = 0.01 to 0.10. 
The culture is then aliquoted into microtitre wells containing the potential 
inhibitors and, in the case of PKC-7, also phorbol ester. The growth of cells 
monitored at 595 nm using a microtitre plate reader. Cells are allowed to grow 
until parallel wells/plates containing cells growing in the presence of thiamine 
15 (15 ixM) have increased their OD595 to 1 .0 units. Cells from the test wells that 
have proliferated can be scored relative to both control wells (+ thiamine) and 
no addition wells (-inhibitor, -thiamine). Additionally, the test wells may 
contain or lack TPA. 

20 



BNSDOCID: <WO 9403609A1 J_> 



wo 94/03609 PCr/GB93/01651 

29 

REFERENCES 

1. Russell, P. & Nurse, P. (1986) Cell 45, 145-153. 
5 2. Maundrell, K. (1990) J. Biol. Chem. 265, 10857-10864. 

3. Morgan, S.J. et al (1990) Eur. J. Biochem. 191, 761-767. 

4. Otsu, M. et al (1991) Cell 65, 91-104. 

10 

5. Laemmli, U.K. (1970) Nature 221, 680-685. 

6. Katan, M. et al (1988) Cell 54, 171-177. 

15 7. Sambrook, J. et al (1989) "Molecular Cloning, a Laboratory Manual" 
C.S.H. Laboratory Press. 

8. Moreno, S. et al (1991) "Molecular Genetic Analysis of fission yeast 
Schizosaccharomyces pombe ". In Methods in Enzymology. pp 795- 

20 823. 

9. Shibasaki, F. et al (1991) J. Biol. Chem. 266, 8108-8114. 

10. Schaap, D. et al (1989) FEBS Lett. 243, 351-357. 

25 



BNSOOCID: <WO_9403609A1J_> 



wo 94/03609 



PCT/GB93/01651 



30 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Imperial Cancer Research Technology Ltd 

(B) STREET: Sardinia House, Sardinia Street 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

<F) POSTAL CODE (ZIP): WC2A 3NL 

(G) TELEPHONE: 071 242 1136 

(H) TELEFAX: 071 831 4991 

(ii) TITLE OF INVENTION: Transformed cells and assays using them 
(iii) NUMBER OF SEQUENCES: 15 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1*0, Version #1,25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3498 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3204 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG CCT CCA AGA CCA TCA TCA 
Met Pro Pro Arg Pro Ser Ser 
1 5 

CCC CCA AGA ATC CTA GTA GAA 
Pro Pro Arg lie Leu Val Glu 
20 

ACT TTA GAA TGC CTC CGT GAG 
Thr Leu Glu Cys Leu Arg Glu 
35 

CTA TTT AAA GAA GCA AGA AAA 
Leu Phe Lys Glu Ala Arg Lys 

50 .55 

GAA TCT TCT TAC ATT TTC GTA 
Glu Ser Ser Tyr lie Phe Val 



GGT GPiPi CTG TGG GGC ATC CAC TTG ATG 48 
Gly Glu Leu Trp. Gly lie His Leu Met 
10 15 

TGT TTA CTA CCA AAT GGG ATG ATA GTG 96 
Cys Leu Leu Pro Asn Gly Met lie Val 
25 30 

GCT ACG TTA ATA ACG ATA AAG CAT GAA 144 
Ala Thr Leu lie Thr lie Lys His Glu 
40 45 

TAC CCT CTC CAT CAA CTT CTT CAA GAT 192 
Tyr Pro Leu His Gin Leu Leu Gin Asp 
60 

AGT GTT ACC CAA GAA GCA GAA AGG GAA 240 
Ser Val Thr Gin Glu Ala Glu Arg Glu 
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65 70 75 80 

GAA TTT TTT GAT GAA ACA AGA CGA CTT TGT GAG CTT CGG CTT TTT CAA 288 
Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gin 
85 90 95 

CCC TTT TTA AAA GTA ATT GAA CCA GTA GGC AAC CGT GAA GAA AAG ATC 336 
Pro Phe Leu Lys Val lie Glu Pro Val Gly Asn Arg Glu Glu Lys lie 
100 105 110 

CTC AAT CGA GAA ATT GGT TTT GCT ATC GGC ATG CCA GTG TGT GAA TTC 384 
Leu Asn Arg Glu lie Gly Phe Ala lie Gly Met Pro Val Cys Glu Phe 
lis 120 125 

GAT ATG GTT AAA GAT CCA GAA GTA CAG GAC TTC CGA AGA AAT ATT CTC 432 
Asp Met Val Lys Asp Pro Glu Val Gin Asp Phe Arg Arg Asn lie Leu 
130 135 140 

AAT GTT TGT AAA GAA GCT GTG GAT CTT AGG GAT CTT AAT TCA CCT CAT 480 
Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145 150 155 160 

AGT AGA GCA ATG TAT GTT TAT CCT CCA AAT GTA GAA TCT TCA CCA GAA 528 
Ser Arg Ala Met Tyr Val. Tyr Pro Pro Asn Val Glu Ser Ser Pro Glu 
165 170 175 

CTG CCA AAG CAC ATA TAT AAT AAA TTG GAT AAA GGG CAA ATA ATA GTG 576 
Leu Pro Lys His lie Tyr Asn Lys Leu Asp Lys Gly Gin lie lie Val 
180 185 190 

GTG ATT TGG GTA ATA GTT TCT CCA AAT AAT GAC AAA CAG AAG TAT ACT 624 
Val lie Trp Val lie Val Ser Pro Asn Asn Asp Lys Gin Lys Tyr Thr 
195 200 205 

CTG AAA ATC AAC CAT GAC TGT GTG CCA GAA CAA GTA ATT GCT GAA GCA 672' 
Leu Lys lie Asn His Asp Cys Val Pro Glu Gin Val He Ala Glu Ala 
210 215 220 

ATC AGG AAA AAA ACT CGA AGT ATG TTG CTA . TCA TCT GAA CAA CTA AAA 720 
He Arg Lys Lys Thr Arg Ser Met Leu Leu Ser Ser Glu Gin Leu Lys 
225 230 235 240 

CTC TGT GTT TTA GAA TAT CAG GGC AAG TAT ATT TTA AAA GTG TGT GGA 768 
Leu Cys Val Leu Glu Tyr Gin Gly Lys Tyr He Leu Lys Val Cys Gly 
245 250 255 

TGT GAT GAA TAG TTC CTA GAA AAA TAT CCT CTG AGT CAG TAT AAG TAT 816 
Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gin Tyr Lys Tyr 
260 265 270 

ATA AGA AGC TGT ATA ATG CTT GGG AGG ATG CCC AAT TTG ATG CTG ATG 864 
He Arg Ser Cys He Met Leu Gly Arg Met Pro Asn Leu Met Leu Met 
275 280 285 

GCT AAA GAA AGC CTC TAT TCT CAA CTG CCA ATG GAC TGT TTT ACA ATG 912 
Ala Lys Glu Ser Leu Tyr Ser Gin Leu Pro Met Asp Cys Phe Thr Met 
290 295 300 

CCA TCA TAT TCC AGA CGC ATC TCC ACA GCT ACG CCA TAT ATG AAT GGA 960 
Pro Ser Tyr Ser Arg Arg He Ser Thr Ala Thr Pro Tyr Met Asn Gly 
305 310 315 320 

GAA ACA TCT ACA AAA TCC CTT TGG GTT ATA AAT AGT GCA CTC AGA ATA 1008 
Glu Thr Ser Thr Lys Ser Leu Trp Val He Asn Ser Ala Leu Arg He 
325 330 335 
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AAA ATT CTT TGT GCA ACC TAT GTG AAT GTA AAT ATT CGA GAG ATT GAG 1056 

Lys lie Leu Cys Ala Thr Tyr Val Asn Val Asn He Arg Asp lie Asp 
340 345 350 

AAG ATT TAT GTT CGA ACA GGT ATC TAG CAT GGA GGA GAA CCC TTA TGT 1104 
Lys He Tyr Val Arg Thr Gly He Tyr His Gly Gly Glu Pro Leu Cys 
355 360 365 

GAT AAT GTG AAC ACT CAA AG A GTA CCT TGT TCC AAT CCC AGG TGG AAT 1152 
Asp Asn Val Asn Thr Gin Arg Val Pro Cys Ser Asn Pro Arg Trp Asn 
370 375 380 

GAA TGG CTG AAT TAC GAT ATA TAG ATT CCT GAT CTT CCT CGT GCT GCT 1200 
Glu Trp Leu Asn Tyr Asp He Tyr He Pro Asp Leu Pro Arg Ala Ala 
385 390 395 400 

CGA CTT TGC CTT TCC ATT TGT TCT GTT AAA GGC CGA AAG GGT GCT AAA 1248 
Arg Leu Cys Leu Ser He Cys Ser Val Lys Gly Arg Lys Gly Ala Lys 
405 410 415 

GAG GAA CAC TGT CCA TTG GCC TGG GGA AAT ATA AAC TTG TTT GAT TAC 1296 
Glu Glu His Cys Pro Leu Ala Trp Gly Asn He Asn Leu Phe Asp Tyr 
420 425 430 

ACA GAT ACT CTA GTA TCT GGA AAA ATG GCT TTG AAT CTT TGG CCA GTA 1344 
Thr Asp Thr Leu Val Ser Gly Lys Met Ala Leu Asn Leu Trp Pro Val 
435 440 445 

CCT CAT GGA CTA GAA GAT TTG CTG AAC CCT ATT GGT GTT ACT GGA TCA 1392 
Pro His Gly Leu Glu Asp Leu Leu Asn Pro He Gly Val Thr Gly Ser 
450 455 460 

AAT CCA AAT AAA GAA ACT CCA TGT TTA GAG TTG GAG TTT GAC TGG TTC 1440 
Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465 470 475 480 

AGC AGT GTG GTA AAG TTT CCA GAT ATG TCA GTG ATT GAA GAG CAT GCC 1488 
Ser Ser Val Val Lys Phe Pro Asp Met Ser Val He Glu Glu His Ala 
485 490 495 

AAT TGG TCT GTA TCC CGT GAA GCA GGA TTT AGT TAT TCC CAT GCA GGA 1536 
Asn Trp Ser Val Ser Arg Glu Ala Gly Phe Ser Tyr Ser His Ala Gly 
500 505 510 

CTG AGT AAC AGA CTA GCT AGA GAC AAT GAA TTA AGA GAA AAT GAT AAA 1584 
Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
515 520 525 

GAA CAG CTC CGA GCA ATT TGT ACA CGA GAT CCT CTA TCT GAA ATC ACT 1632 
Glu Gin Leu Arg Ala He Cys Thr Arg Asp Pro Leu Ser Glu He Thr 
530 535 540 

GAG CAA GAG AAA GAT TTT CTG TGG AGC CAC AGA CAC TAT TGT GTA ACT 1680 
Glu Gin Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 
545 550 555 560 

ATC CCC GAA ATT CTA CCC AAA TTG CTT CTG TCT GTT AAA TGG AAC TCT 1728 
He Pro Glu He Leu Pro Lys Leu Leu Leu Ser Val Lys Trp Asn Ser 
565 570 575 

AGA GAT GAA GTA GCT CAG ATG TAC TGC TTG GTA AAA GAT TGG CCT CCA 1776 
Arg Asp Glu Val Ala Gin Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 
580 585 590 

ATC AAG CCT GAA CAG GCT ATG GAG CTT CTG GAC TGC AAT TAC CCA GAT 1824 
He Lys Pro Glu Gin Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 
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595 600 60S 

CCT ATG GTT CGA GGT TTT GOT GTT CGG TGC TTA GAA AAA TAT TTA ACA 1872 
Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
610 615 620 

GAT GAC AAA CTT TCT CAG TAG CTA ATT GAG CTA GTA GAG GTA CTA AAA 1920 
Asp Asp Lys Leu Ser Gin Tyr Leu lie Gin Leu Val Gin Val Leu Lys 
625 630 635 640 

TAT GAA CAG TAT TTG GAT AAC CTG CTT GTG AG A TTT TTA CTC AAA AAA 1968 
Tyr Glu Gin Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 
645 650 655 

GCG TTA ACT AAT CAA AGG ATC GGT CAC TTT TTC TTT TGG CAT TTA AAA 2016 
Ala Leu Thr Asn Gin Arg lie Gly His Phe Phe Phe Trp His Leu Lys 
660 665 670 

TCT GAG ATG CAC AAT AAA ACA GTT AGT CAG AGG TTT GGC CTG CTT TTG 2064 
Ser Glu Met His Asn Lys Thr Val Ser Gin Arg Phe Gly Leu Leu Leu 
675 680 685 

GAG TCC TAT TGC CGT GCA TGT GGG ATG TAT CTG AAG CAC CTT AAT AGG 2112 
Glu Ser Tyr Cys Arg Ala Cys Gly Met Tyr Leu Lys His Leu Asn Arg 
690 695 700 

CAA GTT GAG GCT ATG GAA AAG CTC ATT AAC TTG ACT GAC ATT CTC AAA 2160 
Gin Val Glu Ala Met Glu Lys Leu lie Asn Leu Thr Asp He Leu Lys 
705 710 715 720 

CAA GAG AAG AAG GAT GAA ACA CAA AAG GTA CAG ATG AAG TTT TTA GTT 2208 
Gin Glu Lys Lys Asp Glu Thr Gin Lys Val Gin Met Lys Phe Leu Val 
725 730 735 

GAG CAA ATG CGG CGA CCA GAT TTC ATG GAT GCT CTC CAG GGC TTT CTG 2256 
Glu Gin Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gin Gly Phe Leu ' 
740 745 750 

TCT CCT CTA AAC CCT GCT CAT CAG CTG GGA AAT CTC AGG CTT GAA GAG 2304 
Ser Pro Leu Asn Pro Ala His Gin Leu Gly Asn Leu Arg Leu Glu Glu 
755 760 765 

TGT CGA ATT ATG TCT TCT GCA AAA AGG CCA CTG TGG TTG AAT TGG GAG 2352 
Cys Arg He Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
770 775 780 

AAC CCA GAC ATC ATG TCA GAA TTA CAC TTT CAG AAC AAT GAG ATC ATC 2400 
Asn Pro Asp He Met Ser Glu Leu His Phe Gin Asn Asn Glu He He 
785 790 795 800 

TTT AAA AAT GGG GAT GAT TTA CGG CAA GAT ATG CTA ACC CTT CAG ATT 2448 
Phe Lys Asn Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin He 
805 810 815 

ATT CGC ATT ATG GAA AAT ATC TGG CAA AAT CAA GGT CTT GAT CTT CGA 2496 
He Arg He Met Glu Asn He Trp Gin Asn Gin Gly Leu Asp Leu Arg 
820 825 830 

ATG TTA CCT TAT GGA TGT CTG TCA ATC GGT GAC TGT GIG GGA CTT ATC 2544 
Met Leu Pro Tyr Gly Cys Leu Ser He Gly Asp Cys Val Gly Leu He 
835 840 845 

GAG GTG GTG AG A AAT TCT CAC ACT ATA ATG CAG ATT CAG TGT AAA GGA 2592 
Glu Val Val Arg Asn Ser His Thr He Met Gin He Gin Cys Lys Gly 
850 855 860 
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GGC CTG AAA GOT GCA CTG CAG TTT AAC AGC CAC ACA CTC CAT CAG TGG 2640 
Gly Leu Lys Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp 
865 870 875 880 

CTC AAA GAC AAG AAC AAG GGG GAA ATA TAT GAT GCG GCC ATC GAT TTG 2688 
Leu Lys Asp Lys Asn Lys Gly Glu lie Tyr Asp Ala Ala lie Asp Leu 
885 890 895 

TTT ACA CGA TCA TGT GCT GGA TAT TGT GTT GCC ACC TTC ATT TTG GGA 2736 
Phe Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe lie Leu Gly 
900 905 910 

ATT GGA GAT CGT CAC AAT AGT AAT ATC ATG GTT AAA GAT GAT GGA CAA 2784 
lie Gly Asp Arg His Asn Ser Asn lie Met Val Lys Asp Asp Gly Gin 
915 920 925 

CTG TTT CAT ATA GAT TTT GGA CAC TTT TTG GAT CAC AAG AAG AAA AAA 2832 
Leu Phe His lie Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
930 935 940 

TTT GGT TAT AAA CGA GAG CGC GTG CCG TTT GTT TTG ACA CAA GAT TTC 2880 
Phe Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gin Asp Phe 
945 950 955 960 

TTA ATA GTG ATT AGT AAA GGA GCC CAA GAA TGC ACA AAG ACA AGA GAA 2928 
Leu lie Val lie Ser Lys Gly Ala Gin Glu Cys Thr Lys Thr Arg Glu 
965 970 975 

TTT GAG AGG TTT CAG GAG ATG TGT TAC MiG GCT TAT CTA GCT ATT CGG 2976 
Phe Glu Arg Phe Gin Glu Met Cys Tyr Lys Ala Tyr Leu Ala lie Arg 
980 985 990 

CAG CAT GCC AAT CTC TTC ATA AAT CTT TTC TCA ATG ATG CTT GGC TCT 3024 
Gin His Ala Asn Leu Phe lie Asn Leu Phe Ser Met Met Leu Gly Ser 
995 1000 1005 

GGA ATG CCA GAA CTG CAA TCT TTT GAT GAT ATT GCA TAC ATT CGA AAG 3072 
Gly Met Pro Glu Leu Gin Ser Phe Asp Asp lie Ala Tyr lie Arg Lys 
1010 1015 1020 

ACC CTA GCT TTA GAT AAA ACT GAG CAA GAG GCT TTG GAG TAT TTC ATG 3120 
Thr Leu Ala Leu Asp Lys Thr Glu Gin Glu Ala Leu Glu Tyr Phe Met 
1025 1030 1035 1040 

AAA CAA ATG AAT GAT GCA CAC CAT GGT GGC TGG ACA ACA AAA ATG GAT 3168 
Lys Gin Met Asn Asp Ala His His Gly Gly Trp Thr Thr Lys Met Asp 
1045 1050 1055 

TGG ATC TTC CAC ACA ATT AAG CAG CAT GCT TTG AAC TGAAATGATA 3214 
Trp lie Phe His Thr lie Lys Gin His Ala Leu Asn 
1060 1065 



ACTAAAAGCT 


CAGTATCTGG 


ATTCTACACT 


GCACTGTTAA 


TAACTGTCAA 


CAGGCAAAGA 


3274 


CTGATTGCAT 


AGGAATTGCA 


CTUVTCCATGA 


ACAGCATTAG 


AATTTACAGC 


AAGAACAGAA 


3334 


ATAAAATACT 


ATATAATTTA 


AATAATGTAA 


ACGCAAACAG 


GGTTTGATAG 


CACTAAACTA 


3394 


GTTCATTTCA 


AAATTAAGCT 


TTAGAATAAT 


GCGCAATTTC 


ATGTTATGCC 


TTAAGTCCAA 


3454 


AAAGGTAAAC 


TTTAAAGATT 


GTTTGTATCT 


TTCCTTTAAA 


AAAA 




3498 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1068 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Pro Arg Pro Ser Ser Gly Glu Leu Trp Gly lie His Leu Met 
15 10 15 

Pro Pro Arg lie Leu Val Glu Cys Leu Leu Pro Asn Gly Met lie Val 
20 25 30 

Thr Leu Glu Cys Leu Arg Glu Ala Thr Leu lie Thr lie Lys His Glu 
35 40 45 

Leu Phe Lys Glu Ala Arg Lys Tyr Pro Leu His Gin Leu Leu Gin Asp 
50 55 60 

Giu Ser Ser Tyr lie Phe Val Ser Val Thr Gin Glu Ala Glu Arg Glu 
65 70 75 80 

Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gin 
85 90 95 

Pro Phe Leu Lys Val lie Glu Pro Val Gly Asn Arg Glu Glu Lys lie 
100 105 110 

Leu Asn Arg. Glu lie Gly Phe Ala lie Gly Met Pro Val Cys Glu Phe 
115 120 125 

Asp Met Val Lys Asp Pro Glu Val Gin Asp Phe Arg Arg Asn lie Leu 
130 135 140 

Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145 150 155 160 

Ser Arg Ala Met Tyr Val Tyr Pro Pro Asn Val Glu Ser Ser Pro Glu 
165 170 175 

Leu Pro Lys His He Tyr Asn Lys Leu Asp Lys Gly Gin He He Val 
180 185 190 

Val He Trp Val He Val Ser Pro Asn Asn Asp Lys Gin Lys Tyr Thr 
195 200 205 

Leu Lys He Asn His Asp Cys Val Pro Glu Gin Val He Ala Glu Ala 
210 215 220 

He Arg Lys Lys Thr Arg Ser Met Leu Leu Ser Ser Glu Gin Leu Lys 
225 230 235 240 

Leu Cys Val Leu Glu Tyr Gin Gly Lys Tyr He Leu Lys Val Cys Gly 
245 250 255 

Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gin Tyr Lys Tyr 
260 - 265 270 

He Arg Ser Cys He Met Leu Gly Arg Met Pro Asn Leu Met Leu Met 
275 280 285 

Ala Lys Glu Ser Leu Tyr Ser Gin Leu Pro M t Asp Cys Phe Thr Met 
290 295 300 

Pro Ser Tyr Ser Arg Arg He Ser Thr Ala Thr Pro Tyr Met Asn Gly 
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305 310 315 320 

Glu Thr Ser Thr Lys Ser Leu Trp Val lie Asn Ser Ala Leu Arg lie 
325 330 335 

Lys lie Leu Cys Ala Thr Tyr Val Asn Val Asn lie Arg Asp lie Asp 
340 345 350 

Lys lie Tyr Val Arg Thr Gly lie Tyr His Gly Gly Glu Pro Leu Cys 
355 360 365 

Asp Asn Val Asn Thr Gin Arg Val Pro Cys Ser Asn Pro Arg Trp Asn 
370 375 380 

Glu Trp Leu Asn Tyr Asp He Tyr lie Pro Asp Leu Pro Arg Ala Ala 
385 390 395 400 

Arg Leu Cys Leu Ser He Cys Ser Val Lys Gly Arg Lys Gly Ala Lys 
405 410 415 

Glu Glu His Cys Pro Leu Ala Trp Gly Asn He Asn Leu Phe Asp Tyr 
420 425 430 

Thr Asp Thr Leu Val Ser Gly Lys Met Ala Leu Asn Leu Trp Pro Val 
435 440 445 

Pro His Gly Leu Glu Asp Leu Leu Asn Pro He Gly Val Thr Gly Ser 
450 455 460 

Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465 470 475 480 

Ser Ser Val Val Lys Phe Pro Asp Met Ser Val He Glu Glu His Ala 
485 490 495 

Asn Trp Ser Val Ser Arg Glu Ala Gly Phe Ser Tyr Ser His Ala Gly 
500 505 510 

Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
515 520 525 

Glu Gin Leu Arg Ala He Cys Thr Arg Asp Pro Leu Ser Glu He Thr 
530 535 540 

Glu Gin Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 
545 550 555 560 

He Pro Glu He Leu Pro Lys Leu Leu Leu Ser Val Lys Trp Asn Ser 
565 570 575 

Arg Asp Glu Val Ala Gin Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 
580 585 590 

He Lys Pro Glu Gin Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 
595 600 605 

Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
610 615 620 

Asp Asp Lys Leu Ser Gin Tyr Leu He Gin Leu Val Gin Val Leu Lys 
625 630 635 640 

Tyr Glu Gin Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 
645 650 655 

Ala Leu Thr Asn Gin Arg He Gly His Phe Phe Phe Trp His Leu Lys 
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660 665 670 

Ser Glu Met His Asn Lys Thr Val Ser Gin Krg Phe Gly Leu Leu Leu 
.675 680 685 

Glu Ser Tyr Cys Arg Ala Cys Gly Met Tyr Leu Lys His Leu Asn Arg 
690 695 700 

Gin Val Glu Ala Met Glu Lys Leu lie Asn Leu Thr Asp lie Leu Lys 
705 710 715 720 

Gin Glu Lys Lys Asp Glu Thr Gin Lys Val Gin Met Lys Phe Leu Val 
725 730 735 

Glu Gin Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gin Gly Phe Leu 
740 745 750 

Ser Pro Leu Asn Pro Ala His Gin Leu Gly Asn Leu Arg Leu Glu Glu 
755 760 765 

Cys Arg lie Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
770 775 780 

Asn Pro Asp lie Met Ser Glu Leu His Phe Gin Asn Asn Glu lie lie 
785 790 795 800 

Phe Lys Asn Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin lie 
805 810 815 

lie Arg lie Met Glu Asn lie Trp Gin Asn Gin Gly Leu Asp Leu Arg 
820 825 830 

Met Leu Pro Tyr Gly Cys Leu Ser He Gly Asp Cys Val Gly Leu He 
835 840 845 

Glu Val Val Arg Asn Ser His Thr He Met Gin He Gin Cys Lys Gly 
850 855 860 

Gly Leu Lys Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp 
865 870 875 880 

Leu Lys Asp Lys Asn Lys Gly Glu He Tyr Asp Ala Ala He Asp Leu 
885 890 895 

Phe Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe He Leu Gly 
900 905 910 

He Gly Asp Arg His Asn Ser Asn He Met Val Lys Asp Asp Gly Gin 
915 920 925 

Leu Phe His He Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
930 935 940 

Phe Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gin Asp Phe 
945 950 955 960 

Leu He Val He Ser Lys Gly Ala Gin Glu Cys Thr Lys Thr Arg Glu 
965 970 975 

Phe Glu Arg Phe Gin Glu Met Cys Tyr Lys Ala Tyr Leu Ala He Arg 
980 985 990 

Gin His Ala Asn Leu Phe He Asn Leu Phe Ser Met Met Leu Gly Ser 
995 1000 1005 

Gly Met Pro Glu Leu Gin Ser Phe Asp Asp He Ala Tyr He Arg Lys 
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1010 1015 1020 

Thr Leu Ala Leu Asp Lys Thr Glu Gin Glu Ala Leu Glu Tyr Phe Met 
1025 1030 1035 1041 

Lys Gin Met Asn Asp Ala His His Gly Gly Trp Thr Thr Lys Met Asp 
1045 1050 1055 

Trp lie Phe His Thr lie Lys Gin His Ala Leu Asn 
1060 1065 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2199 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: SCHI20SACCHAR0MYCES POMBE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AAAAATCTCA ACACATGTGA ATGATCAGAA AATTATCGCC ATAAAAGACA GAATAAGTCA 60 

TCAGCGGTTG TTTCATTTCC TATATTTTTT TTTTATTTTT TTATTTTTTA ATAAGGGAAA 120 

ATTTAACGTC TAAGGATACA GAAGATTGTT AGCACATTAA AGTAATAAAG GCTTAAGTAG 180 

TAAGTGCCTT AGCATGTTAT TGTATTTCAA AGGACATAAT CTAAAATAAT AACAATATCA 240 

TTTCTCACAA GTTATTCAAT TTTCTTTTTT TTTTCTAATA ATATCAAGAA TGTATTATTT 300 

GTTTGACATA AGTCAACTAA TTTATTTAAT ATGCTGGATT AATCTTGCAG ACATGTAAAT 360 

TAACAAGTTT TAGTCAAATA ACGTTGAAGT TTCAATGAAC TCAAATAATT TCTCTTTTTT 420 

TTTATATAAC CATATGTCTA ATCTGATTTA TATTTTCCGC AGGATCAACT GAAGTTATGA 480 

CATTTGGATT GGATCACTTA TAACCTTGGT CGCCAAATAA TACAAAAATC AGCGTTATAA 540 

AACAAAGAAG GTTTTTGTTA AGAAATTAAT CCTCTTTCTT GATAAGAAAG TTGAACCGAA 600 

ATTGCAGATA CTGATATATG AAAATAATAC CCACAATTTT GGGT^TAGCG CAAGCCTCAA 660 

TTTAAACAAT AGGTGAGGAC ACATGATAAT GACCTCAATG ATTGTTAGAA GAAAAGAGCC 720 

TCATTACAAA ATCGAAAAAT GAATGGTTGG GTACAAGTTT CCAAAACATG GTAAAGTGGA 780 

CTTTGCGTAT GAGACGTAAA TAGAAAAAAA CACTTGTTAT ATGTTTTCTA GAATTATTGT 840 

TGTCTCTTTA TGGTTGGATG ATGCAAAATA GTAATTTCGG TTAGTTGCTG TAAAACACCA 900 

CGAGACAAAT AGATATGGAT ATTTATTAAA TCAGGAAAAA CGTAACTCTC GGCTACTGGA 960 

TGGTTCAGTC ACCCAACGAT TACTGGGGAG AGAAAACAGG GCAAAAGCAA AGCTTAAAGG 1020 

AATCCGATTG TCATTCGGCA ATGTGCAGCG AAACTAAAAA CCGGATAATG GACCTGTTAA 1080 
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TCGAAACATT 


GAAGATATAT 


AAAGGAAGAG 


GAATCCTGGC 


ATATCATCAA 


TTGAATAAGT 


1140 


TGAATTAATT 


ATTTCAATCT 


CATTCTCACT 


TTCTGACTTA 


TAGTCGCTTT 


GTTAAATCAT 


1200 


AGGAATGTCT 


CCCTTGCCAG 


TACTGCTAGG 


GTTTXTCTTT 


CAAACTATGG 


AAGCCCATTC 


1260 


AAGCTGCATA 


TTACGATTTT 


GTTTTTCGCT 


TTTAGAAAGT 


GGTTTAGATG 


AGATAATAGA 


1320 


AAAATTCTTG 


ATCTCCGACA 


ACGAGTACTT 


TTATTTTTTT 


TGCTAATCAC 


TTTACTCAAT 


1380 


ATTAGCTCGA 


AATCGTAGAA 


ACGTAGACGG 


GTGCGGGATA 


CCGAGTGGTG 


TAGTTAAGAA 


1440 


TTTTTATAAA 


CCACGTGGCC 


CAAAAATATG 


AACCCAAAAC 


GTTTATACAT 


GAGTATACTT 


1500 


TAAGAAGGCT 


ATACCCCTTC 


GTGTTAGATG 


TAGTTTTAGC 


TACCCAACCC 


GAGTCTATGA 


1560 


GCTTGACTTC 


AGATGTAGAA 


GGCATTAAAT 


CGTTTTGAAT 


ATTAATTAAA 


AAACGATGAA 


1620 


AATTAAATAT 


TTAAAAGCAA 


TCATACGCTG 


AAAATTTAGT 


GCTGTGGCTA 


ATCCTTCAAC 


1680 


ATGGAAATGC 


CATAAAAGTG 


ACTTTGACAA 


AAAAAAAAGT 


ATATACAGGT 


AGTAAACTCA 


1740 


TCTACTTCAT 


TGACTTTGTT 


TACAGCATGT 


GGAAGGAGGA 


ATATTTATTG 


CTAAATCGTA 


1800 


GTTTAACATT 


CAATAAGTAA 


TACTATTGAA 


ATTCGACAAG 


ATTGGCCGCA 


TGGATGAAAA 


1860 


AGAGGCATTT 


TGCTTTGGGA 


GAATTAGTTC AAATTAGAAC 


TGAAAAAAAA 


AACTTTACGA 


1920 


GGCAAAAATG 


TCGGATTGAG 


ATCGTAAAAG 


TTCGCTCGTC 


GTCTTTTGCT 


TTGTGATTGT 


1980 


TTTCATGGAT 


ACATCTTGCT 


GGATATTTAA 


ATTTTAGTAC 


TATGTATAAG 


ATATTCTATA 


2040 


AATGTTTTAT 


CACCCAAACC 


TGTTAGCGCC 


TTCTTAATTC 


TATTCAATCT 


GGCTTTTGCT 


2100 


CTGAGACTAC 


TTCTTGGACT 


TTCACTACTT 


GTTAGTTATA 


CGGAATTTGT 


GTAATTAGAA 


2160 


GTGAAATAAT 


CCTTTCTATT 


AGTAATGCAA 


ACAAAAATC 






2199 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CTCGAGCTGA 


AGAACCAGCG 


AGGCGGCGAG 


GCAGCCCCCG 


CGGCTTGCAG 


CGGAGGCGAC 


60 


AGCTCGTCTC 


CTGCCGTGGA 


GGTGTCGCCG 


GTGGTGGGGG 


GGAGAGACTT 


GCTCCAAAAA 


120 


AACGGACGTC 


TCCAGCTCTC 


CCCCCTCCCT 


GTTTTCCGTT 


AGGAATCCGG 


CGAGGAAATA 


180 


CATGCACTCG 


CTGAGAATCG 


GCGGCGCCAG 


GAGGCAGCGC 


CACAAGGTGT 


AGCGAGTGAG 


240 


TGGGGTGGGG 


CAAGAGGGGA 


CCCAGGAGTC 


CCCCAGGCTC 


CCGGCGCGCC 


TGCTCCTGCT 


300 
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CTTCAATCCT 


GCCCACGGGG 


CGGACGGAGT 


GACCCCCGCC 


CCGACCATGG 


TAGTGTTCAA 


360 


TGGCCTTCTT 


AAGATCAAAA 


TCTGCGAGGC 


GGTGAGCTTG 


AAGCCCACAG 


CCTGGTCGCT 


420 


GCGCCATGCG 


GTGGGACCCC 


GGCCACAGAC 


GTTCCTTTTG 


GACCCCTACA 


TTGCCCTTAA 


480 


CGTGGACGAC 


TCGCGCATCG 


GCCAAACAGC 


CACCAAGCAA 


AAGACCAACA 


GCCCGGCCTG 


540 


GCACGATGAG 


TTCGTCACCG 


ATGTGTGCAA 


TGGGCGCAAG 


ATCGAGCTGG 


CTGTCTTTCA 


600 


CGACGCTCCT 


ATCGGCTACG 


ACGACTTCGT 


GGCCAACTGC 


ACCATCCAGT 


TCGAGGAGCT 


660 


GCTGCAGAAT 


GGGAGCCGTC 


ACTTCGAGGA 


CTGGATTGAC 


CTGGAGCCAG 


AAGGAAAAGT 


720 


GTACGTGATC 


ATCGATCTCT 


CGGGATCATC 


GGGTGAAGCC 


CCTAAAGACA 


ATGAAGAACG 


780 


AGTGTTCAGG 


GAGCGTATGC 


GGCCAAGGAA 


GCGGCAAGGG 


GCTGTCAGGC 


GCAGGGTCCA 


840 


CCAGGTCAAT 


GGCCACAAGT 


TCATGGCCAC 


CTACTTGCGG 


CAACCCACCT 


ACTGCTCCCA 


900 


CTGCAGAGAT 


TTCATCTGGG 


GTGTCATAGG 


AAAACAGGGA 


TATCAATGTC 


AAGTTTGCAC 


960 


TTGCGTTGTC 


CACAAGCGAT 


GTCATGAGCT 


CATTATTACA 


AAGTGCGCTG 


GGCTGAAGAA 


1020 


ACAGGAAACC 


CCTGACGAGG 


TGGGCTCCCA 


ACGGTTCAGC 


GTCAACATGC 


CCCACAAGTT 


1080 


CGGGATCCAC 


AACTACAAGG 


TCCCCACGTT 


CTGTGACCAC 


TGTGGGTCCC 


TGCTCTGGGG 


1140 


CCTCTTGCGG 


CAGGGCTTGC 


AGTGTAAAGT 


CTGCAAAATG 


AATGTTCACC 


GGCGATGTGA 


1200 


GACCAACGTG 


GCTCCCAACT 


GTGGGGTAGA 


CGCCAGAGGA 


ATTGCCAAAG 


TGCTGGCTGA 


1260 


CCTCGGTGTT 


ACTCCAGACA 


AAATCACCAA 


CAGTGGCCAA 


AGGAGGAAAA 


AGCTCGCTGC 


1320 


TGGTGCTGAG 


TCCCCACAGC 


CGGCTTCTGG 


AAACTCCCCA 


TCTGAAGACG 


ACCGATCCAA 


1380 


GTCAGCGCCC 


ACCTCCCCTT 


GTGACCAGGA 


ACTAAAAGAA 


CTTGAT^CA 


ACATCCGGAA 


1440 


GGCCTTGTCA 


TTTGACAACC 


GAGGAGAGGA 


GCACCGAGCG 


TCGTCGGCCA 


CCGATGGCCA 


1500 


GCTGGCAAGC 


CCCGGAGAGA 


ATGGGGAAGT 


CCGGCCAGGC 


CAGGCCAAGC 


GCTTGGGGCT 


1560 


GGATGAGTTC 


AACTTCATCA 


AAGTGTTGGG 


CAAAGGCAGC 


TTTGGCAAGG 


TCATGTTGGC 


1620 


GGAACTCAAA 


GGCAAAGATG 


AAGTCTACGC 


TGTGAAGGTC 


TTGAAGAAGG 


ACGTTATCCT 


1680 


ACAAGACGAT 


GATGTGGACT 


GCACAATGAC 


AGAGAAGAGG 


ATTTTGGCTC 


TGGCTCGGAA 


1740 


ACACCCTTAT 


CTAACCCAAC 


TCTATTGCTG 


CTTCCAGACC 


AAGGACCGCC 


TCTTCTTCGT 


1800 


CATGGAATAT 


GTAAATGGTG 


GAGACCTCAT 


GTTCCAGATT 


CAGCGGTCCC 


GAAAATTTGA 


1860 


TGAGCCTCGT 


TCTCGGTTCT 


ATGCCGCAGA 


GGTCACATCG 


GCCCTCATGT 


TTCTCCACCA 


1920 


GCATGGAGTG 


ATCTACAGGG 


ATTTGAAACT 


GGACAACATC 


CTTCTAGATG 


CAGAAGGCCA 


1980 


CTGCAAGCTG 


GCTGACTTTG 


GGATGTGCAA 


GGAAGGGATT 


ATGAATGGTG 


TGACAACTAC 


2040 


CACCTTCTGT. 


GGGACTCCTG 


ACTACATAGC 


TCCAGAGATC 


CTACAGGAGT 


TGGAGTACGG 


2100 


CCCCTCAGTG 


GACTGGTGGG 


CCCTGGGGGT 


GCTGATGTAC 


GAGATGATGG 


CTGGGCAGCC 


2160 


CCCCTTTGAA 


GCTGACAACG 


AGGACGACTT 


GTTCGAATCC 


ATCCTTCATG 


ATGATGTTCT 


2220 


CTATCCTGTC 


TGGCTTAGCA 


AGGAAGCTGT 


CAGCATCCTG 


AAAGCTTTCA 


TGACCAAGAA 


2280 
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CCCGCACAAG CGCCTGGGCT GTGTGGCAGC GCAGAACGGG GAGGACGCCA TCAAGCAACA 2340 

TCCATTCTTC AAGGAGATTG ACTGGGTACT GCTGGAGCAG AAGAAAATCA AGCCCCCCTT 2400 

CAAGCCGAGA ATTAAAACCA AAAGAGATGT CAATAACTTT GACCAAGACT TTACGCGGGA 2460 

AGAGCCAATA CTTACACTTG TGGATGAAGC AATCATTAAG CAGATCAACC AGGAAGAATT 2520 

CAAAGGCTTC TCCTACTTTG GTGAAGACCT GATGCCCTGA GAGGCTGCTT CGGATGGAGG 2580 

GAGCTCATGC TGCAAGGACG GTGTTGAGAT ACTCCCAAGC TGCAGAGGCT CCGAAGGTCT 2640 

CAACTCCTCC TCCTCCTCCC CCTCCCCAGA GCCCCAGTCC CATGTCCACT CTCTTATTTA 2700 

TTGCATT 2707 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GGCCCCTGTT 


CTGCAGAAAG 


GGGGCTCTGA 


GGCAGAAGGT 


GGTCCATGAG 


GTCAAGAGCC 


60 


ACAAGTTCAC 


CGCTCGCTTC 


TTCAAGCAGC 


CGACCTTCTG 


CAGCCACTGC 


ACTGACTTCA 


120 


TATGGGGGAT 


TGGAAAACAG 


GGTCTGCAAT 


GTCAAGTCTG 


CAGTTTTGTG 


GTTCATCGAC 


180 


GATGCCACGA 


GTTTGTGACC 


TTCGAGTGTC 


CAGGCGCTGG 


GAAGGGCCCC 


CAGACGGACG 


240 


ATCCCCGGAA 


CAAGCACAAG 


TTCCGTCTGC 


ACAGCTACAG 


CAGCCCCACC 


TTCTGCGACC 


300 


ACTGTGGCTC 


CCTGCTCTAC 


GGGCTGGTGC 


ACCAGGGCAT 


GAAGTGTTCT 


TGCTGCGAGA 


360 


TGAACGTGCA 


CCGGCGCTGT 


GTGCGCAGCG 


TGCCCTCTCT 


GTGCGGCGTG 


GACCACACGG 


420 


AGCGCCGGGG 


CCGCCTGCAG 


CTGGAGATCC 


GGGCGCCCAC 


TTCCGATGAG 


ATCCACGTTA 


480 


CGGTTGGCGA 


GGCCCGGAAC 


CTCATCCCAA 


TGGACCCCAA 


CGGTCTCTCC 


GATCCCTATG 


540 


TGAAGCTGAA 


GCTCATCCCA 


GACCCTCGGA 


ATTTGACCAA 


GCAGAAGACC 


CGCACGGTGA 


600 


AAGCTACGCT 


AAACCCTGTG 


TGGAACGAGA 


CCTTTGTGTT 


CAACCTGAAG 


CCGGGGGACG 


660 


TGGAGCGCCG 


GCTCAGCGTG 


GAGGTGTGGG 


ACTGGGACCG 


GACCTCCCGA 


AACGACTTCA 


720 


TGGGCGCCAT 


GTCCTTCGGC 


GTCTCGGAGC 


TGCTCAAGGC 


GCCGGTGGAC 


GGCTGGTACA 


780 


AGTTACTGAA 


CCAGGAGGAG 


GGCGAGTATT 


ACAATGTGCC 


GGTGGCTGAC 


GCCGACAACT 


840 


GCAACCTCCT 


CCAGAAGTTC 


GAGGCCTGTA 


ACTACCCCCT 


GGAACTATAC 


GAGAGGGTGC 


900 


GGACGGGTCC 


CTCTTCATCT 


CCCATCCCCT 


CCCCATCCCC 


CAGTCCCACC 


GACTCCAAGC 


960 
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GCTGTTTCTT CGGGGCCAGC CCTGGACGAC TGCACATCTC 
TTCTAGGAAA AGGCAGTTTT GGGAAGGTGA TGCTGGCCGA 
TCTACGCCAT CAAGATCCTG AAGAAAGACG TGATCGTCCA 
CCCTGGTGGA GAAACGCGTG CTGGCTCTGG GGGGCCGAGG 
TCCTCACCCA GCTTCACTCC ACCTTCCAGA CCCCGGATCG 
ATGTCACCGG GGGCGACTTG ATGTACCACA TTCAACAGCT 
ACGCAGCGTT CTACGCTGCA GAAATCGCCA TCGGCCTCTT 
TTATCTATCG GGACCTGAAA CTGGACAACG TGATGCTGGA 
TCACCGACTT CGGCATGTGT AAGGAGAACG TCTTTCCCGG 
GCGGGACCCC GGACTACATA GCCCCCGAGA TCATTGCCTA 
TGGATTGGTG GTCCTTTGGG GTTCTGCTCT ACGAGATGTT 
ATGGAGAAGA TGAGGAGGAG CTGTTTCAAG CCATCATGGA 
AGTCGCTTTC CCGGGAAGCT GTGGCCATCT GCAAGGGGTT 
AGCGCCTGGG CTCAGGCCCC GATGGAGAGC CCACCATCCG 
GGATCGACTG GGACAGGCTG GAACGATTAG AGATCGCGCC 
GTGGCCGCAG CGGCGAGAAC TTCGACAAGT TCTTCACTCiS 
CCCCTGACCG CCTGGTTCTG GCCAGCATCG ACCAGGCTGA 
TCAACCCGGA TTTCGTGCAC CCGGATGCCC GCAGCCCCAT 
TCATGTAATC CCACCTGCCG CCACCAGGCG TCCCCACGGC 
TGGCCCTCGC CTCACCATGC CACCCGCCTT TCCAATTCTA 
TGGCCTC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 2891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) -MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



CGACTTCAGC 
GCGCCGGGGC 
GGATGACGAC 
CCCGGGAGGC 
CCTGTATTTT 
GGGCAAGTTT 
CTTCCTCCAT 
TGCCGAAGGA 
GAGTACCACT 
CCAACCCTAT 
GGCAGGACAG 
ACAAACTGTC 
CCTCACCAAG 
CGCTCACGGC 
TCCGTTCAGA 
GGCGGCGCCG 
GTTCCAGGGC 
CAGCCCAACG 
TCCCTCCTCC 
GATATGGCTC 



TTCCTCATGG 
TCCGATGAGC 
GTGGACTGCA 
CGGCCGCACT 
GTGATGGAGT 
AAGGAACCCC 
AACCAGGGCA 
CACATCAAAA 
CGCACCTTCT 
GGGAAGTCTG 
CCCCCCTTTG 
ACCTACCCCA 
CACCCGGCCA 
TTTTTCCGCT 
CCCCGCCCGT 
GCGCTGACAC 
TTCACCTATG 
CCTGTGCCAG 
GCCCCGGCTT 
CCCAGCGTTC 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2167 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGCGGCGGCC GCGGGGATCC CGCGAGCGGC CCCTGAACAT CTACCCTTCT TGCCGGGACC 
CGGGAGGTCC CCACTGGCCT CCGGGCCCGT CCTGATCAGA CTCGTGTCGA CCTCCCCGTC 
CACGCGCATC CGGGAGAGCC GCGCCACGAG ACGGACCCGG GCCCGCCGGG ACCCCTGGTG 



60 
120 
180 
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TCTGGCCCTG 


CGTCGAGAGG 


CTGGTGACTG 


CCACCCATAA 


GCTCCAGCTT 


CAGCCTCGGC 


240 


TTACTCCCCT 


CAGGGGCTTG 


CAGGCTGAGG 


CCTGCCCTCG 


GACGCGGCTG 


ACCAGCCTCT 


300 


CCCTCTCTTC 


CACACTTTGG 


ACTTCTCTTT 


GGACCTCCTA 


AAAAGGCTCC 


ATCATGGCAC 


360 


CGTTCCTGCG 


CATCTCCTTC 


AATTCCTATG 


AGCTGGGCTC 


CCTGCAGGCG 


GAGGACGACG 


420 


CAAGCCAGCC 


TTTCTGTGCC 


GTGAAGATGA 


AGGAGGCACT 


CACCACAGAC 


CGAGGGAAGA 


480 


CTCTGGTACA 


GAAGAAGCCC 


ACAATGTACC 


CTGAGTGGAA 


GTCAACATTC 


GACGCCCACA 


540 


TCTATGAAGG 


CCGTGTCATC 


CAGATCGTGC 


TGATGCGGGC 


AGCTGAAGAC 


CCCATGTCGG 


600 


AGGTGACCGT 


GGGCGTGTCA 


GTGCTGGCTG 


AGCGCTGCAA 


GAAGAACAAC 


GGCAAGGCTG 


660 


AGTTCTGGCT 


GGACCTGCAG 


CCTCAGGCCA 


AGGTGCTGAT 


GTGTGTGCAG 


TATTTCCTGG 


720 


AGGATGGGGA 


TTGCAAACAG 


TCCATGCGTA 


GTGAGGAGGA 


GGCCATGTTC 


CCAACTATGA 


780 


ACCGCCGTGG 


AGCCATTAAA 


CAGGCCAAGA 


TTCACTACAT 


CAAGAACCAC 


GAGTTCATCG 


840 


CCACCTTCTT 


TGGGCAGCCC 


ACCTTCTGTT 


CTGTGTGCAA 


AGAGTTTGTC 


TGGGGCCTCA 


900 


ACAAGCAAGG 


CTACAAATGC 


AGGCAATGCA 


ACGCTGCCAT 


CCATAAGAAA 


TGCATCGACA 


960 


AGATTATCGG 


CCGCTGCACT 


GGCACTGCTA 


CCAATAGCCG 


GGACACCATC 


TTCCAGAAAG 


1020 


AACGCTTCAA 


CATCGACATG 


CCTCACCGAT 


TCAAGGTCTA 


TAACTACATG 


AGCCCCACCT 


1080 


TCTGTGACCA 


CTGTGGCACT 


TTGCTCTGGG 


GATTGGTGAA 


ACAGGGATTA 


AAGTGTGAAG 


1140 


ACTGCGGCAT 


GAATGTGCAC 


CACAAATGCC 


GGGAGAAGGT 


GGCCAACCTG 


TGTGGTATCA 


lioo 


ACCAAAAGCT 


CTTGGCTGAG 


GCCTTGAACC 


AAGTGACCCA 


GAAAGCTTCC 


CGGAAGCCAG 


1260 


AGACACCAGA 


GACTGTCGGA 


ATATACCAGG 


GATTCGAGAA 


GAAGACAGCT 


GTCTCTGGGA 


1320 


ATGACATCCC 


AGACAACAAC 


GGGACCTATG 


GCAAGATCTG 


GGAGGGGAGC 


AACCGGTGCC 


1380 


GCCTTGAGAA 


CTTCACCTTC 


CAGAAAGTAC 


TTGGCAAAGG 


CAGCTTTGGC 


AAGGTACTGC 


1440 


TTGCAGAACT 


GAAGGGCAAG 


GAAAGGTACT 


TTGCAATCAA 


GTACCTGAAG 


AAGGACGTGG 


1500 


TGTTGATCGA 


CGATGACGTG 


GAGTGCACCA 


TGGTGGAGAA 


GCGGGTGCTG 


GCGCTCGCCT 


1560 


GGGAGAATCC 


CTTCCTCACC 


CATCTCATCT 


GTACCTTCCA 


GACCAAGGAC 


CACCTCTTCT 


1620 


TTGTGATGGA 


GTTCCTCAAT 


GGGGGCGATC 


TGATGTTCCA 


CATTCAGGAC 


AAAGGCCGCT 


1680 


TCGAACTCTA 


CCGGGCTACG 


TTTTATGCAG 


CTGAGATCAT 


CTGCGGACTG. 


CAGTTTCTAC 


1740 


ATGGCAAAGG 


CATCATTTAC 


AGGGACCTCA 


AGCTAGACAA 


TGT7UVTGCTG 


GACAAGGATG 


1800 


GCCACATCAA 


GATTGCTGAC 


TTCGGGATGT 


GCAAAGAGAA 


TATATTTGGG 


GAGAACCGGG 


1860 


CCAGCACATT 


CTGCGGCACT 


CCTGACTACA 


TCGCCCCTGA 


GATCCTGCAG 


GGCCTGAAGT 


1920 


ACTCATTTTC 


CGTGGACTGG 


TGGTCTTTTG 


GGGTCCTCCT 


CTATGAGATG 


CTCATTGGCC 


1980 


AGTCCCCCTT 


CCATGGTGAT 


GATGAGGACG 


AGCTCTTTGA 


GTCCATCCGG 


GTGGACACAC 


2040 


CACACTACCC 


GCGCTGGATC 


ACCAAGGAGT 


CCAAGGACAT 


CATGGAGAAG 


CTCTTCGAGA 


2100 


GGGACCCTGC 


CAAGAGGCTG 


GGAGTAACAG 


CAAACATCAG 


GCTTCACCCC 


TTTTTCAAGA 


2160 
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CTATCAACTG GAACCTGCTG GAGAAGCGGA AGGTGGAGCC 
AATCCCCTTC AGACTACAGC AACTTTGACC CAGAGTTCCT 
CCTTCAGTGA CAAGAACCTC ATCGACTCTA TGGACCAGAC 
TTGTGAACCC CAAATATGAG CAATTCCTGG AATAGTGAGC 
CCCCGGCAGA GTAGGCCCAT CTGCCCTGGT TTGCATCCTC 
GGTGACTGGT GATTCCTGCT GCTGCCCCCT CTTCCTCGGA 
GGGCTCACAG TACTTCCTCT GTGAACTGTT TGTGAATTTG 
GGAAACTGTA AATCCTGTGT GTCATTACTT GAATGTAGTT 
TATGCACATA TATATAATAG GCTGTATATA TTGCTCAGTA 
GGTGATGTGT TGACCTTTTT TTAAAAAAAA CCATATGTAT 
ACACACGTAT ACATATATGT ATGTATGTAT GTATGTATGT 
AAAGAGAGCA CAAGCTACCT GAACCACAGG ATTGTTTATG 
ATGGTAAAAA A 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



CCCCTTTAAG 
GAATGAGAAA 
AGCCTTCAAG 
TCCCAGACCT 
ACTGCCCATG 
GAGTCTGGCT 
CCTTCCTTTT 
ATTGAAATAT 
TAGAAAGCAT 
ACGTGTGTAT 
ATGTATATAT 
TGTGTATAAA 



CCCAAAGTGA 

CCCCAACTTT 

GGCTTCTCCT 

GCTTTTAATG 

AAGAAGAGTG 

CCTGTTGGCT 

GCCATCGGAG 

ATATTATATA 

GTAGGAGACT 

GTATACATCT 

GACCAAAAGA 

TAAACACTGA 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2891 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCCGGGTTCC CCAGTGCCAG CCAGCGCGGC CCCCTCGGGG 
TGTCGTCCGG CACGATGAAG TTCAATGGCT ATCTGAGGGT 
GGCTGCAGCC CACCCGCTGG TCCCTGCGGC ACTCGCTCTT 
TGGACCCCTA CCTGACGGTG AGCGTAGACC AGGTACGCGT 
AGAAGACCAA CAAACCCACC TACAACGAGG AGTTCTGCGC 
ACCTGGAGCT AGCCGTCTTC CACGAGACGC CCCTGGGTTA 
GCACGCTGCA GTTCCAGGAG CTGTTGCGCA CGGCTGGTAC 
GGGTGGATCT GGAGCCTGAG GGGAAAGTGT TTGTGGTAAT 
CTGAAGCCAC TCTCCAGAGA GACCGCATCT TCAAGCATTT 
CTATGCGAAG ACGAGTCCAT CAAGTGAACG GACATAAGTT 
AGCCCACCTA CTGCTCTCAT TGCCGAGAGT TCATCTGGGG 



CTCCGGCAGC 
CCGCATCGGA 
CAAAAAGGGC 
GGGCCAGACC 
CAATGTCACC 
TGACCACTTT 
CTCGGACACC 
AACCCTAACA 
TACCAGGAAG 
CATGGCCACG 
AGTATTTGGG 



AGCGCCGGCA 
GAGGCTGTAG 
CACCAGCTGC 
AGCACAAAGC 
GACGGCGGCC 
GTGGCCAACT 
TTCGAGGGCT 
GGGAGTTTCA 
CGCCAAAGGG 
TACCTGAGGC 
AAACAGGGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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ATCAATGCCA 


AGTGTGCACC 


TGCGTCGTCC 


ATAAACGCTG 


CCATCATCTA 


ATTGTTACAG 


720 


CCTGCACTTG 


CCAAAACAAT 


ATTAACAAAG 


TGGATGCCAA 


GATTGCAGAA 


CAGCGGTTTG 


780 


GCATCAACAT 


CCCACACAAG 


TTCAACGTTC 


ACAATTACAA 


GGTGCCCACG 


TTCTGTGACC 


840 


ACTGTGGCTC 


CCTGCTCTGG 


GGGATAATGC 


GACAAGGACT 


TCAGTGTAAA 


ATATGTAAGA 


900 


TGAATGTACA 


TATTCGGTGT 


CAGGCGAACG 


TGGCCCCAAA 


CTGCGGGGTG 


AATGCCGTGG 


960 


AGCTTGCCAA 


GACCCTGGCA 


GGGATGGGTC 


TCCAACCCGG 


AAATATTTCT 


CCAACCTCGA 


1020 


AACTCATTTC 


CAGGTCGACA 


CTAAGACGGC 


AGGGAAAGGA 


GGGCTCCAAA 


GAAGGAAATG 


1080 


GGATCGGTGT 


TAACTCTTCC 


AGCAGATTCG 


GCATCGACAA 


CTTTGAGTTC 


ATCCGGGTGT 


1140 


TGGGGAAGGG 


GAGCTTCGGG 


AAGGTGATGC 


TTGCCAGGAT 


AAAGGAGACA 


GGAGAACTGT 


1200 


ACGCCGTGAA 


GGTGCTGAAG 


AAGGACGTGA 


TTCTGCAGGA 


TGATGATGTA 


GAGTGCACCA 


1260 


TGACTGAGAA 


GAGGATCCTG 


TCCTTGGCTC 


GCAACCACCC 


CTTCCTCACC 


CAGCTCTTCT 


1320 


GCTGCTTTCA 


GACTCCAGAC 


CGTCTGTTCT 


TTGTCATGGA 


GTTTGTGAAC 


GGAGGCGACC 


1380 


TGATGTTCCA 


CATCCAAAAG 


TCCCGTCGTT 


TCGATGAAGC 


CCGTGCTCGT 


TTCTACGCCG 


1440 


CGGAGATCAT 


TTCTGCACTC 


ATGTTCCTAC 


ATGAGAAAGG 


TATCATCTAT 


AGAGACTTGA 


1500 


AACTGGACAA 


TGTGCTATTG 


GACCACGAAG 


GTCACTGTAA 


ACTGGCCGAT 


TTTGGAATGT 


1560 


GCAAGGAGGG 


GATTTGTAAT 


GGGGTCACCA 


CAGCCACCTT 


CTGCGGTACA 


CCTGACTACA 


1620 


TTGCCCCAGA 


GATCCTTCAG 


GAGATGCTGT 


ATGGACCTGC 


AGTAGACTGG 


TGGGCCATGG 


1680 


GCGTGTTGCT 


TTATGAGATG 


CTGTGCGGAC 


ATGCGCCCTT 


TGAGGCTGAA 


AATGAAGATG 


1740 


ACCTTTTTGA 


GGCCATACTG 


AATGATGAAG 


TCGTCTACCC 


CACCTGGCTC 


CATGAAGATG 


1800 


CCAGAGGGAT 


CCTCAAGTCT 


TTCATGACCA 


AGAACCCCAC 


CATGCGCTTG 


GGCAGCCTGA 


1860 


CTCAGGGAGG 


AGAGCATGAG 


ATCCTGAGAC 


ACCCTTTCTT 


TAAGGAAATC 


GACTGGGCCC 


1920 


AGTTGAACCA 


TCGCCAGTTA 


GAGCCGCCTT 


TCCGACCTAG 


AATCAAATCC 


CGAGAAGATG 


1980 


TCAGCAATTT 


TGACCCAGAC 


TTTATAAAAG 


AAGAGCCCGT 


CTTAACTCCG 


ATTGATGAGG 


2040 


GACATCTTCC 


TATGATTAAC 


CAGGATGAGT 


TTAGAAACTT 


TTCCTATGTG 


TCACCGGAAT. 


2100 


TGCAACTGTA 


GCCTTATGGG 


GAGTCAGAAC 


CAAAGGGGAA 


GGTGGATTTC 


TCCAGGAATT 


2160 


TCTTATGTGG 


GAATTC 










2176 


(2) INFORMATION FOR SEQ ID NO: 8: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
Met Asp Trp lie Phe His Thr 
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1 5 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AARATGGAYT GGATHTTYCA YAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Asp Gly Gin Leu Phe His He Asp Phe Gly His Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE" TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GATGATGGCC ARCTGTTYCA YATWGAYTTT GGCCAYTT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AATTCACACA CTGGCATGCC GAT 23 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GACTCGAGTC GACATCGATT TTTTTTTTTT TTTTT 35 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTTAAGCTTA GGCATTCTAA AGTCACTATC ATCCC 35 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GACTCGAGTC GACATCGA 18 



BNSOOCID: <WO 9403609A1 J_> 



wo 94/03609 



PCT/GB93/01651 



48 

CLAIMS 

1. A eukaryotic cell transformed with a DNA construct comprising a 
coding sequence encoding a polypeptide having the activity of a 

5 mammalian phospholipid kinase or a mammalian protein kinase 

activated by a phospholipid or its metabolite and regulatory elements to 
allow transcription of the coding sequence in the said cell wherein the 
regulatory elements include a repressible or inducible promoter and the 
expression of the said coding sequence is lethal or growth inhibitory to 

10 the cell. 

2. A cell according to Claim 1 wherein the cell is a yeast cell. 

3 . A cell according to Claim 2 wherein the yeast is Schizosaccharomyces. 

15 

4. A cell according to Claim 3 wherein the promoter is the nmt promoter. 

5. A Schizosaccharomyces cell transformed with a DNA construct 
comprising a coding sequence encoding a polypeptide having the 

20 activity of a mammalian phospholipid kinase or a mammalian protein 

kinase activated by a phospholipid or its metabolite and regulatory 
elements to allow transcription of the coding sequence in the said cell 
wherein the regulatory elements include a constitutive promoter and the 
expression of the said coding sequence is lethal or growth inhibitory to 

25 the cell. 

6. A Schizosaccharomyces cell according to Claim 5 wherein the promoter 
is the adh promoter. 



30 7. A cell according to any one of the preceding claims wherein the 
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phospholipid kinase is an inositol phospholipid kinase. 

8. A cell according to any one of Claims 1 to 6 wherein the protein kinase 
activated by a phospholipid or its metabolite is a protein kinase C. 

5 

9. A cell according to Claim 7 wherein the phospholipid kinase is selected 
from the group consisting of phosphatidyl inositol 3-kinase, 
phosphatidyl inositol 4-kinase and phosphatidyl inositol-5-kinase. 

10 10. A cell according to Claim 9 wherein the phospholipid kinase is 
phosphatidyl inositol 3-kinase. 

11. A cell according to Claim 8 wherein the protein kinase C is selected 
from any one of PKC-7, PKC-5, PKC-t; or PKC-e. 

15 

12. An assay for detecting whether a compound is involved in cell growth 
regulation, the assay comprising (1) a cell according to any one of the 
preceding claims, (2) a container for the said cell, (3) a growth medium 
for the said cell and (4) means to detect the viability of the cell. 

20 

13. A kit comprising a eukaryotic cell as defined in Claim 1 and culture 
medium such that the cell will divide and grow. 

14. A method for assaying for a compound that is involved in cell growth 
25 regulation the method comprising (1) culturing a cell as defined in 

Claim 1 , (2) adding a compound and (3) determining the cell growth 
rate in the presence of the compound. 

15. A compound identified by the assay of Claim 12 or the method of 
30 Claim 14 as being involved in cell growth regulation. 
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AAAAATCTCA 


ACACATGTGA 


ATGATCAGAA 


AATTATCGCC 


ATAAAAGACA 


GAATAAGTCA 


60 


TCAGCGGTTG 


TTTCATTTCC 


TATATTTTTT 


TTTTATTTTT 


TTATTTTTTA 


ATAAGGGAAA 


120 


ATTTAACGTC 


T\ TS ^ ^ TV TV ^ Th 

TAAGGATACA 


GAAGATTGTT 


AGCACATTAA 


AGTAATAAAG 


GCTTAAGTAG 


180 


TAAGTGCCTT 


AGCATGTTAT 


TGTATTTCAA 


AGGACATAAT 


CTAAAATAAT 


AACAATATCA 


240 


TTTCTCACAA 


GTTATTCAAT 


TTTCTTTTTT 


TTTTCTAATA 


ATATCAAGAA 


TGTATTATTT 


300 


GTTTGACATA 


AGTCAACTAA 


TTTATTTAAT 


ATGCTGGATT 


AATCTTGCAG 


ACATGTAAAT 


360 


T AAC AAG TTT 


TAGTCAAATA 


ACGTTGAAGT 


TTCAATGAAC 


TCAAATAATT 


TCTCTTTTTT 


420 


TTTATATAAC 


CATATGTCTA 


ATCTGATTTA 


TATTTTCCGC 


AGGATCAACT 


GAAGTTATGA 


480 


CATTTGGATT 


^n^Hiv ^^#nm Ti 

GGATCACTTA 


TAACCTTGGT 


CGCCAAATAA 


TACAAAAATC 


AGCGTTATAA 


540 


AAC AAA AAG 


GTTTTTGTTA 


AGAAATTAAT 


CCTCTTTCTT 


GATAAGAAAG 


TTGAACCGAA 


600 


Al IIjCAOAXA 


r^TT^r^ TV fT> TV rp TV 

CTGATATATG 


AAAATAATAC 


CCACAATTTT 


GGGAATAGCG 


CAAGCCTCAA 


660 


TTTAAACAAT 


AGGTGAGGAC 


ACATGATAAT 


GACCTCAATG 


ATTGTTAGAA 


GAAAAGAGCC 


720 


TCATTACAAA 


ATCGAAAAAT 


GAATGGTTGG 


GTACAAGTTT 


CCAAAACATG 


GTAAAGTGGA 


7.80 


CTTTGCGTAT 


GAGACGTAAA 


TAGAAAAAAA 


CACTTGTTAT 


ATGTTTTCTA 


GAATTATTGT 


840 


TGTCTCTTTA 


TGGTTGGATG 


ATGCAAAATA 


GTAATTTCGG 


TTAGTTGCTG 


TAAAACACCA 


900 


CGAGACAAAT 


AGATATGGAT 


ATTTATTAAA 


TCAGGAAAAA 


CGTAACTCTC 


GGCTACTGGA 


. 960 


TGGTTCAGTC 


ACCCAACGAT 


TACTGGGGAG 


AGAAAACAGG 


GCAAAAGCAA 


AGCTTAAAGG 


1020 


AATCCGATTG 


TCATTCGGCA 


ATGTGCAGCG 


AAACTAAAAA 


CCGGATAATG 


GACCTGTTAA 


1080 


T CG AAAC ATT 


GAAGATATAT 


AAAGGAAGAG 


GAATCCTGGC 


ATATCATCAA 


TTGAATAAGT 


1140 


±OAAl X AAl X 


ATTTCAATCT 


CATTCTCACT 


TTCTGACTTA 


TAGTCGCTTT 


GTTAAATCAT 


1200 


ALvVj AA X X 1 


UCUTTGCCAG 


TACTGCTAGG 


GTTTTTCTTT 


CAAACTATGG 


AAGCCCATTC 


1260 


AAGL- xGCAT A 


TTACGATTTT 


GTTTTTCGCT 


TTTAGAAAGT 


GGTTTAGATG 


AGATAATAGA 


1320 


AAAATTCTTG 


T\ rTl ^ ^ TV TV 

ATCTCCGACA 


ACGAGTACTT 


TTATTTTTTT 


TGCTAATCAC 


TTTACTCAAT 


1380 


ATT AG CTCG A 


AATCGTAGAA 


ACGTAGACGG 


GTGCGGGATA 


CCGAGTGGTG 


TAGTTAAGAA 


1440 


TTTTTATAAA 


CCACGTGGCC 


CAAAAATATG 


AACCCAAAAC 


GTTTATACAT 


GAGTATACTT 


1500 


TAAG AAGG CT 


ATACCCCTTC 


GTGTTAGATG 


TAGTTTTAGC 


TACCCAACCC 


GAGTCTATGA 


1560 


VjL>X XGACrXC 


AGA TGTAG AA 


O ^ TV rnni TV TV TV 

GGCATTAAAT 


CGTTTTGAAT 


ATTAATTAAA 


AAACGATGAA 


1620 


AAu T AAATAT 


ry> rt\ tv tv tvo^tv tv 

TT AAAAG C AA 


TCATACGCTG 


AAAATTTAGT 


GCTGTGGCTA 


ATCCTTCAAC 


1660 


ATGG AAATG C 


CATAAAAGTG 


ACTTTGACAA 


AAAAAAAAGT 


ATATACAGGT 


AGTAAACTCA 


1740 


TCTACTTCAT 


TGACTTTGTT 


TACAGCATGT 


GGAAGGAGGA 


ATATTTATTG 


CTAAATCGTA 


1800 


GTTTAACATT 


CAATAAGTAA 


TACTATTGAA 


ATTCGACAAG 


ATTGGCCGCA 


TGGATGAAAA 


1860 


AGAGGCATTT 


TGCTTTGGGA 


GAATTAGTTC 


AAATTAGAAC 


TGAAAAAAAA 


AACTTTACGA 


1920 


GGCAAAAATG 


TCGGATTGAG 


ATCGTAAAAG 


TTCGCTCGTC 


GTCTTTTGCT 


TTGTGATTGT 


1980 


TTTCATGGAT 


ACATCTTGCT 


GGATATTTAA 


ATTTTAGTAC 


TATGTATAAG 


ATATTCTATA 


2040 
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AATGTTTTAT CACCCAAACC TGTTAGCGCC TTCTTAATTC TATTCAATCT GGCTTTTGCT 2100 

CTGAGACTAC TTCTTGGACT TTCACTACTT GTTAGTTATA CGGAATTTGT GTAATTAGAA 2160 

GTGAAATAAT CCTTTCTATT AGTAATGCAA ACAAAAATC 2199 
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CTCGAGCTGA AGAACCAGCG AGGCGGCGAG GCAGCCCCCG CGGCTTGCAG CGGAGGCGAC 60 

AGCTCGTCTC CTGCCGTGGA GGTGTCGCCG GTGGTGGGGG GGAGAGACTT GCTCCAAAAA 120 

AACGGACGTC TCCAGCTCTC CCCCCTCCCT GTTTTCCGTT AGGAATCCGG CGAGGAAATA 180 

CATGCACTCG CTGAGAATCG GCGGCGCCAG GAGGCAGCGC CACAAGGTGT AGCGAGTGAG 240 

TGGGGTGGGG CAAGAGGGGA CCCAGGAGTC CCCCAGGCTC CCGGCGCGCC TGCTCCTGCT 300 

CTTCAATCCT GCCCACGGGG CGGACGGAGT GACCCCCGCC CCGACCATGG TAGTGTTCAA 360 

TGGCCTTCTT AAGATCAAAA TCTGCGAGGC GGTGAGCTTG AAGCCCACAG CCTGGTCGCT 420 

GCGCCATGCG GTGGGACCCC GGCCACAGAC GTTCCTTTTG GACCCCTACA TTGCCCTTAA 480 
CGTGGACGAC TCGCGCATCG GCCAAACAGC CACCAAGCAA AAGACCAACA GCCCGGCCTG ' 540 

GCACGATGAG TTCGTCACCG ATGTGTGCAA TGGGCGCAAG ATCGAGCTGG CTGTCTTTCA 600 

CGACGCTCCT ATCGGCTACG ACGACTTCGT GGCCAACTGC ACCATCCAGT TCGAGGAGCT 660 

GCTGCAGAAT GGGAGCCGTC ACTTCGAGGA CTGGATTGAC CTGGAGCCAG AAGGAAAAGT 720 

GTACGTGATC ATCGATCTCT CGGGATCATC GGGTGAAGCC CCTAAAGACA ATGAAGAACG 780 

AGTGTTCAGG GAGCGTATGC GGCCAAGGAA GCGGCAAGGG GCTGTCAGGC GCAGGGTCCA 840 

CCAGGTCAAT GGCCACAAGT TCATGGCCAC CTACTTGCGG CAACCCACCT ACTGCTCCCA 900 

CTGCAGAGAT TTCATCTGGG GTGTCATAGG AAAACAGGGA TATCAATGTC AAGTTTGCAC 960 

TTGCGTTGTC CACAAGCGAT GTCATGAGCT CATTATTACA AAGTGCGCTG GGCTGAAGAA 1020 

ACAGGAAACC CCTGACGAGG TGGGCTCCCA ACGGTTCAGC GTCAACATGC CCCACAAGTT 1080 

CGGGATCCAC AACTACAAGG TCCCCACGTT CTGTGACCAC TGTGGGTCCC TGCTCTGGGG 1140 

CCTCTTGCGG CAGGGCTTGC AGTGTAAAGT CTGCAAAATG AATGTTCACC GGCGATGTGA 1200 

GACCAACGTG GCTCCCAACT GTGGGGTAGA CGCCAGAGGA ATTGCCAAAG TGCTGGCTGA 1260 

CCTCGGTGTT ACTCCAGACA AAATCACCAA CAGTGGCCAA AGGAGGAAAA AGCTCGCTGC 1320 

TGGTGCTGAG TCCCCACAGC CGGCTTCTGG AAACTCCCCA TCTGAAGACG ACCGATCCAA 1380 

GTCAGCGCCC ACCTCCCCTT GTGACCAGGA ACTAAAAGAA CTTGAAAACA ACATCCGGAA 1440 

GGCCTTGTCA TTTGACAACC GAGGAGAGGA GCACCGAGCG TCGTCGGCCA CCGATGGCCA 1500 

GCTGGCAAGC CCCGGAGAGA ATGGGGAAGT CCGGCCAGGC CAGGCCAAGC GCTTGGGGCT 1560 

GGATGAGTTC AACTTCATCA AAGTGTTGGG CAAAGGCAGC TTTGGCAAGG TCATGTTGGC 1620 

GGAACTCAAA GGCAAAGATG AAGTCTACGC TGTGAAGGTC TTGAAGAAGG ACGTTATCCT 1680 

ACAAGACGAT GATGTGGACT GCACAATGAC AGAGAAGAGG ATTTTGGCTC TGGCTCGGAA 1740 

ACACCCTTAT CTAACCCAAC TCTATTGCTG CTTCCAGACC AAGGACCGCC TCTTCTTCGT 1800 

CATGGAATAT GTAAATGGTG GAGACCTCAT GTTCCAGATT CAGCGGTCCC GAAAATTTGA 1860 

TGAGCCTCGT TCTCGGTTCT ATGCCGCAGA GGTCACATCG GCCCTCATGT TTCTCCACCA 1920 

GCATGGAGTG ATCTACAGGG ATTTGAAACT GGACAACATC CTTCTAGATG CAGAAGGCCA 1980 

CTGCAAGCTG GCTGACTTTG GGATGTGCAA GGAAGGGATT ATGAATGGTG TGACAACTAC 2040 

Figure 3; page 1 of 2 
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CACCTTCTGT 


gggactcctg 


ACTACATAGC 


TCCAGAGATC 


CTACAGGAGT 


TGGAGTACGG 


2100 


CCCCTCAGTG 


gactggtggg 


CCCTGGGGGT 


GCTGATGTAC 


GAGATGATGG 


CTGGGCAGCC 


2160 


CCCCTTTGAA 


gctgacaacg 


AGGACGACTT 


GTTCGAATCC 


ATCCTTCATG 


ATGATGTTCT 


2220 


CTATCCTGTC 


tggcttagca 


AGGAAGCTGT 


CAGCATCCTG 


AAAGCTTTCA 


TGACCAAGAA 


2280 


cccgcac:aag 


cgcctgggct 


GTGTGGCAGC 


GCAGAACGGG 


GAGGAC6CCA 


TCAAGCAACA 


2340 


TCCATTCTTC 


aaggagattg 


ACTGGGTACT 


GCTGGAGCAG 


AAGAAAATCA 


AGCCCCCCTT 


2400 


caagccgaga 


ATTAAAACCA 


AAAGAGATGT 


CAATAACTTT 


GACCAAGACT 


TTACGCGGGA 


2460 




CTTACACTTG 


TGGATGAAGn 


AATCATTAAG 


C AG AT C AACC 


AGGAAGAATT 


2520 


caaaggcttc 


tcctactttg 


GTGAAGACCT 


GATGCCCTGA 


GAGGCTGCTT 


CGGATGGAGG , 


2580 


gagctcatgc 


tgcaaggacg 


GTGTTGAGAT 


ACTCCCAAGC 


TGCAGAGGCT 


CCGAAGGTCT 


2640 


CAACTCCTCC 


TCCTCCTCCC 


CCTCCCCAGA 


GCCCCAGTCC 


CATGTCCACT 


CTCTTATTTA 
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GGCCCCTGTT CTGCAGAAAG GGGGCTCTGA GGCAGAAGGT GGTCCATGAG GTCAAGAGCC 60 

ACAAGTTCAC CGCTCGCTTC TTCAAGCAGC CGACCTTCTG CAGCCACTGC ACTGACTTCA 120 

TATGGGGGAT TGGAAAACAG GGTCTGCAAT GTCAAGTCTG CAGTTTTGTG GTTCATCGAC 180 

GATGCCACGA GTTTGTGACC TTCGAGTGTC CAGGCGCTGG GAAGGGCCCC CAGACGGACG 240 

ATCCCCGGAA CAAGCACAAG TTCCGTCTGC ACAGCTACAG CAGCCCCACC TTCTGCGACC 300 

ACTGTGGCTC CCTGCTCTAC GGGCTGGTGC ACCAGGGCAT GAAGTGTTCT TGCTGCGAGA 360 

TGAACGTGCA CCGGCGCTGT GTGCGCAGCG TGCCCTCTCT GTGCGGCGTG GACCACACGG 420 

AGCGCCGGGG CCGCCTGCAG CTGGAGATCC GGGCGCCCAC TTCCGATGAG ATCCACGTTA 480 

CGGTTGGCGA GGCCCGGAAC CTCATCCCAA TGGACCCCAA. CGGTCTCTCC GATCCCTATG 540 

TGAAGCTGAA GCTCATCCCA GACCCTCGGA ATTTGACCAA GCAGAAGACC CGCACGGTGA 600 

AAGCTACGCT AAACCCTGTG TGGAACGAGA CCTTTGTGTT CAACCTGAAG CCGGGGGACG 660 

TGGAGCGCCG GCTCAGCGTG GAGGTGTGGG ACTGGGACCG GACCTCCCGA AACGACTTCA 720 

TGGGCGCCAT GTCCTTCGGC GTCTCGGAGC TGCTCAAGGC GCCGGTGGAC GGCTGGTACA 780 

AGTTACTGAA CCAGGAGGAG GGCGAGTATT ACAATGTGCC GGTGGCTGAC GCCGACAACT 840 

GCAACCTCCT CCAGAAGTTC GAGGCCTGTA ACTACCCCCT GGAACTATAC GAGAGGGTGC 900 

GGACGGGTCC CTCTTCATCT CCCATCCCCT CCCCATCCCC CAGTCCCACC GACTCCAAGC 960 

GCTGTTTCTT CGGGGCCAGC CCTGGACGAC TGCACATCTC CGACTTCAGC TTCCTCATGG 1020 

TTCTAGGAAA AGGCAGTTTT GGGAAGGTGA TGCTGGCCGA GCGCCGGGGC TCCGATGAGC 1080 

TCTACGCCAT CAAGATCCTG AAGAAAGACG TGATCGTCCA GGATGACGAC GTGGACTGCA 1140 

CCCTGGTGGA GAAACGCGTG CTGGCTCTGG GGGGCCGAGG CCCGGGAGGC CGGCCGCACT 1200 

TCCTCACCCA GCTTCACTCC ACCTTCCAGA CCCCGGATCG CCTGTATTTT GTGATGGAGT 1260 

ATGTCACCGG GGGCGACTTG ATGTACCACA TTCAACAGCT GGGCAAGTTT AAGGAACCCC 1320 

ACGCAGCGTT CTACGCTGCA GAAATCGCCA TCGGCCTCTT CTTCCTCCAT AACCAGGGCA 1380 

TTATCTATCG GGACCTGAAA CTGGACAACG TGATGCTGGA TGCCGAAGGA CACATCAAAA 1440 

TCACCGACTT CGGCATGTGT AAGGAGAACG TCTTTCCCGG GAGTACCACT CGCACCTTCT 1500 

GCGGGACCCC GGACTACATA GCCCCCGAGA TCATTGCCTA CCAACCCTAT GGGAAGTCTG 1560 

TGGATTGGTG GTCCTTTGGG GTTCTGCTCT ACGAGATGTT GGCAGGACAG CCCCCCTTTG 1620 

ATGGAGAAGA TGAGGAGGAG CTGTTTCAAG CCATCATGGA ACAAACTGTC ACCTACCCCA 1680 

AGTCGCTTTC CCGGGAAGCT GTGGCCATCT GCAAGGGGTT CCTCACCAAG CACCCGGCCA 1740 

AGCGCCTGGG CTCAGGCCCC GATGGAGAGC CCACCATCCG CGCTCACGGC TTTTTCCGCT 1800 

GGATCGACTG GGACAGGCTG GAACGATTAG AGATCGCGCC TCCGTTCAGA CCCCGCCCGT 1860 

GTGGCCGCAG CGGCGAGAAC TTCGACAAGT TCTTCACTCG GGCGGCGCCG GCGCTGACAC 1920 

CCCCTGACCG CCTGGTTCTG GCCAGCATCG ACCAGGCTGA GTTCCAGGGC TTCACCTATG 1980 

TCAACCCGGA TTTCGTGCAC CCGGATGCCC GCAGCCCCAT CAGCCCAACG CCTGTGCCAG 2040 
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TCATGTAATC CCACCTGCCG CCACCAGGCG TCCCCACGGC TCCCTCCTCC GCCCCGGCTT 2100 

TGGCCCTCGC CTCACCATGC CACCCGCCTT TCCAATTCTA GATATGGCTC CCCAGCGTTC 2160 

TGGCCTC 2167 
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GACATCTTCC TATGATTAAC CAGGATGAGT TTAGAAACTT TTCCTATGTG TCACCGGAAT 2100 

TGCAACTGTA GCCTTATGGG GAGTCAGAAC CAAAGGGGAA GGTGGATTTC TCCAGGAATT 2160 

TCTTATGTGG GAATTC 2176 
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